Learn Roslyn Now: Part 13 Keeping track of syntax nodes with Syntax Annotations

It can be tricky to keep track nodes when applying changes to syntax trees. Every time we “change” a tree, we’re really creating a copy of it with our changes applied to that new tree. The moment we do that, any pieces of syntax we had references to earlier become invalid in the context of the new tree.

What’s this mean in practice? It’s tough to keep track of syntax nodes when we change syntax trees.

A recent Stack Overflow question touched on this. How can we get the symbol for a class that we’ve just added to a document? We can create a new class declaration, but the moment we add it to the document, we lose track of the node. So how can we keep track of the class so we can get the symbol for it once we’ve added it to the document?

The answer: Use a SyntaxAnnotation

A SyntaxAnnotation is a basically piece of metadata we can attach to a piece of syntax. As we manipulate the tree, the annotation sticks with that piece of syntax making it easy to find.

	AdhocWorkspace workspace = new AdhocWorkspace();
	Project project = workspace.AddProject("SampleProject", LanguageNames.CSharp);

	//Attach a syntax annotation to the class declaration
	var syntaxAnnotation = new SyntaxAnnotation();
	var classDeclaration = SyntaxFactory.ClassDeclaration("MyClass")
	.WithAdditionalAnnotations(syntaxAnnotation);

	var compilationUnit = SyntaxFactory.CompilationUnit().AddMembers(classDeclaration);

	Document document = project.AddDocument("SampleDocument.cs", compilationUnit);
	SemanticModel semanticModel = document.GetSemanticModelAsync().Result;

	//Use the annotation on our original node to find the new class declaration
	var changedClass = document.GetSyntaxRootAsync().Result.DescendantNodes().OfType<ClassDeclarationSyntax>()
	.Where(n => n.HasAnnotation(syntaxAnnotation)).Single();
	var symbol = semanticModel.GetDeclaredSymbol(changedClass);

view raw

SyntaxAnnotation.cs

hosted with ❤ by GitHub

There are a couple of overloads available when creating a SyntaxAnnotation. We can specify Kind and Data to be attached to pieces of syntax. Data is used to attach extra information to a piece of syntax that we’d like to retrieve later. Kind is a field we can use to search for Syntax Annotations.

So instead of looking for the exact instance of our annotation on each node, we could search for annotations based on their kind:

	AdhocWorkspace workspace = new AdhocWorkspace();
	Project project = workspace.AddProject("Test", LanguageNames.CSharp);

	string annotationKind = "SampleKind";
	var syntaxAnnotation = new SyntaxAnnotation(annotationKind);
	var classDeclaration = SyntaxFactory.ClassDeclaration("MyClass")
	.WithAdditionalAnnotations(syntaxAnnotation);

	var compilationUnit = SyntaxFactory.CompilationUnit().AddMembers(classDeclaration);

	Document document = project.AddDocument("Test.cs", compilationUnit);
	SemanticModel semanticModel = await document.GetSemanticModelAsync();
	var newAnnotation = new SyntaxAnnotation("test");

	//Just search for the Kind instead
	var root = await document.GetSyntaxRootAsync();
	var changedClass = root.GetAnnotatedNodes(annotationKind).Single();

	var symbol = semanticModel.GetDeclaredSymbol(changedClass);

view raw

SyntaxAnnotationWithKind.cs

hosted with ❤ by GitHub

This is just one of a few different ways for dealing with Roslyn’s immutable trees. It’s probably not the easiest to use if you’re making multiple changes and need to track multiple syntax nodes. (If that’s the case, I’d recommend the DocumentEditor). That said, it’s good to be aware of it so you can use it when it makes sense.

Learn Roslyn Now: Part 12 Document Editing with the DocumentEditor

One drawback of Roslyn’s immutability is that it can sometimes make it tricky to apply multiple changes to a Document or SyntaxTree. Immutability means that every time we apply changes to a syntax tree, we’re given an entirely new syntax tree. By default we can’t compare nodes across trees, so what do we do when we want to make multiple changes to a syntax tree?

Roslyn gives us four options:

Use the CSharpSyntaxRewriter and rewrite from the bottom up (See LRN: Part 5)
Use Annotations (See LRN: Part 13)
Use TrackNodes()
Use the DocumentEditor

The DocumentEditor allows us to make multiple changes to a document and get the resulting document after the changes have been applied. Under the covers, the DocumentEditor is a thin layer over the SyntaxEditor.

We’ll use the DocumentEditor to change:

	char key = Console.ReadKey();
	if(key == 'A')
	{
	Console.WriteLine("You pressed A");
	}
	else
	{
	Console.WriteLine("You didn't press A");
	}

view raw

Original.cs

hosted with ❤ by GitHub

to:

	char key = Console.ReadKey();
	if(key == 'A')
	{
	LogConditionWasTrue();
	Console.WriteLine("You pressed A");
	}
	else
	{
	Console.WriteLine("You didn't press A");
	LogConditionWasFalse();
	}

view raw

Edited.cs

hosted with ❤ by GitHub

We’ll use the DocumentEditor to simultaneously insert an invocation before the first Console.WriteLine() and to insert another after the second.

Unfortunately there’s a ton of boiler plate when creating a Document from scratch. Typically you’ll get a Document from a Workspace so it shouldn’t be this bad:

	var mscorlib = MetadataReference.CreateFromAssembly(typeof(object).Assembly);
	var workspace = new AdhocWorkspace();
	var projectId = ProjectId.CreateNewId();
	var versionStamp = VersionStamp.Create();
	var projectInfo = ProjectInfo.Create(projectId, versionStamp, "NewProject", "projName", LanguageNames.CSharp);
	var newProject = workspace.AddProject(projectInfo);
	var sourceText = SourceText.From(@"
	class C
	{
	void M()
	{
	char key = Console.ReadKey();
	if (key == 'A')
	{
	Console.WriteLine(""You pressed A"");
	}
	else
	{
	Console.WriteLine(""You didn't press A"");
	}
	}
	}");
	var document = workspace.AddDocument(newProject.Id, "NewFile.cs", sourceText);
	var syntaxRoot = await document.GetSyntaxRootAsync();
	var ifStatement = syntaxRoot.DescendantNodes().OfType<IfStatementSyntax>().Single();

	var conditionWasTrueInvocation =
	SyntaxFactory.ExpressionStatement(
	SyntaxFactory.InvocationExpression(SyntaxFactory.IdentifierName("LogConditionWasTrue"))
	.WithArgumentList(
	SyntaxFactory.ArgumentList()
	.WithOpenParenToken(
	SyntaxFactory.Token(
	SyntaxKind.OpenParenToken))
	.WithCloseParenToken(
	SyntaxFactory.Token(
	SyntaxKind.CloseParenToken))))
	.WithSemicolonToken(
	SyntaxFactory.Token(
	SyntaxKind.SemicolonToken));

	var conditionWasFalseInvocation =
	SyntaxFactory.ExpressionStatement(
	SyntaxFactory.InvocationExpression(SyntaxFactory.IdentifierName("LogConditionWasFalse"))
	.WithArgumentList(
	SyntaxFactory.ArgumentList()
	.WithOpenParenToken(
	SyntaxFactory.Token(
	SyntaxKind.OpenParenToken))
	.WithCloseParenToken(
	SyntaxFactory.Token(
	SyntaxKind.CloseParenToken))))
	.WithSemicolonToken(
	SyntaxFactory.Token(
	SyntaxKind.SemicolonToken));

	//Finally… create the document editor
	var documentEditor = await DocumentEditor.CreateAsync(document);
	//Insert LogConditionWasTrue() before the Console.WriteLine()
	documentEditor.InsertBefore(ifStatement.Statement.ChildNodes().Single(), conditionWasTrueInvocation);
	//Insert LogConditionWasFalse() after the Console.WriteLine()
	documentEditor.InsertAfter(ifStatement.Else.Statement.ChildNodes().Single(), conditionWasFalseInvocation);

	var newDocument = documentEditor.GetChangedDocument();

view raw

DocumentEditorSample.cs

hosted with ❤ by GitHub

All the familiar SyntaxNode methods are here. We can Insert, Replace and Remove nodes as we see fit, all based off of nodes in our original syntax tree. Many people find this approach more intuitive than building an entire CSharpSyntaxRewriter.

It can be somewhat difficult to debug things when they go wrong. When writing this post I was mistakenly trying to insert nodes after ifStatement.Else instead of ifStatement.Else.Statement. I was receiving an InvalidOperationException but the message wasn’t very useful and it took me quite some time to figure out what I was doing wrong. The documentation on InsertNodeAfter says:

This node must be of a compatible type to be placed in the same list containing the existing node.

How can we know which types of nodes are compatible with one another? I don’t think there’s a good answer here. We essentially have to learn which nodes are compatible ourselves. As usual the Syntax Visualizer and Roslyn Quoter are the best tools for figuring out what kinds of nodes you should be creating.

It’s worth noting that the DocumentEditor exposes the SemanticModel of your original document. You may need this when editing the original document and making decisions about what you’d like to change.

It’s also worth noting that the underlying SyntaxEditor exposes a SyntaxGenerator that you can use to build syntax nodes without relying on the more verbose SyntaxFactory.

Learn Roslyn Now: Part 11 Introduction to Code Fixes

Last time (three months ago, jeez) we talked about building our first analyzer and what we get out of the box with the default analyzer template. Today we’ll talk about the second half of the analyzer project: The Code Fix Provider.

CodeFixProvider.cs

The first thing to notice is that our class inherits from CodeFixProvider. If you take a quick look at CodeFixProvider, you’ll see that it expects you to provide at least two things:

FixableDiagnosticsIds – A list of diagnostic IDs that we would like our code fix to deal with. We would have defined these IDs in our original analyzer.

RegisterCodeFixesAsync – Registers our code fix within Visual Studio to handle our diagnostic(s).

GetFixAllProvider – An optional FixAllProvider that can apply your code fix to all the occurrences of a diagnostic.

Let’s take a look at the first half of this file:

	[ExportCodeFixProvider(LanguageNames.CSharp, Name = nameof(Analyzer1CodeFixProvider)), Shared]
	public class Analyzer1CodeFixProvider : CodeFixProvider
	{
	private const string title = "Make uppercase";

	public sealed override ImmutableArray<string> FixableDiagnosticIds
	{
	get { return ImmutableArray.Create(Analyzer1Analyzer.DiagnosticId); }
	}

	public sealed override FixAllProvider GetFixAllProvider()
	{
	return WellKnownFixAllProviders.BatchFixer;
	}

	…
	}

view raw

FirstHalfCodeFix.csx

hosted with ❤ by GitHub

First, we can see that we’re exporting a code fix provider for C# with the name “Analyzer1CodeFixProvider”. We can also specify additional languages such as VB if you’re writing a multi-language code fix. Note that we have to specify the name explicitly here. (Name is a property in ExportCodeFixProvider. I’d actually never come across this attribute-specific syntax before.)

To start, we’ve got the title of the analyzer which is self explanatory. We’ll expose this title to Visual Studio when we register our code fix action.

Next, we’ve got to expose a list of diagnostics for which we’d like to provide our code fix. In this case, we expose the analyzer we created in the introduction to analyzers.

Finally, the default codefix template overrides the optional GetFixAllProvider. In this case they provide a BatchFixer. The BatchFixer computes all the required changes in parallel and then applies them to the solution at one time.

Now we’ll take a look at the last two methods given to us in CodeFixProvider.cs

	[ExportCodeFixProvider(LanguageNames.CSharp, Name = nameof(Analyzer1CodeFixProvider)), Shared]
	public class Analyzer1CodeFixProvider : CodeFixProvider
	{
	…

	public sealed override async Task RegisterCodeFixesAsync(CodeFixContext context)
	{
	var root = await context.Document.GetSyntaxRootAsync(context.CancellationToken).ConfigureAwait(false);

	// TODO: Replace the following code with your own analysis, generating a CodeAction for each fix to suggest
	var diagnostic = context.Diagnostics.First();
	var diagnosticSpan = diagnostic.Location.SourceSpan;

	// Find the type declaration identified by the diagnostic.
	var declaration = root.FindToken(diagnosticSpan.Start).Parent.AncestorsAndSelf().OfType<TypeDeclarationSyntax>().First();

	// Register a code action that will invoke the fix.
	context.RegisterCodeFix(
	CodeAction.Create(
	title: title,
	createChangedSolution: c => MakeUppercaseAsync(context.Document, declaration, c),
	equivalenceKey: title),
	diagnostic);
	}

	private async Task<Solution> MakeUppercaseAsync(Document document, TypeDeclarationSyntax typeDecl, CancellationToken cancellationToken)
	{
	// Compute new uppercase name.
	var identifierToken = typeDecl.Identifier;
	var newName = identifierToken.Text.ToUpperInvariant();

	// Get the symbol representing the type to be renamed.
	var semanticModel = await document.GetSemanticModelAsync(cancellationToken);
	var typeSymbol = semanticModel.GetDeclaredSymbol(typeDecl, cancellationToken);

	// Produce a new solution that has all references to that type renamed, including the declaration.
	var originalSolution = document.Project.Solution;
	var optionSet = originalSolution.Workspace.Options;
	var newSolution = await Renamer.RenameSymbolAsync(document.Project.Solution, typeSymbol, newName, optionSet, cancellationToken).ConfigureAwait(false);

	// Return the new solution with the now-uppercase type name.
	return newSolution;
	}
	}

view raw

FirstHalfCodeFix2.csx

hosted with ❤ by GitHub

The first is RegisterCodeFixesAsync and it accepts a CodeFixContext. The CodeFixContext has information about where we can apply our code fix, and what diagnostics are available for us to register our code fix against. CodFixContext provides a list of diagnostics for us to choose from based on what we exposed in FixableDiagnosticIds.

Based on my experiments, RegisterCodeFixesAsync is run every time the Visual Studio light bulb appears due to a diagnostic we’ve declared interest in. At this point we can register a action to run that we’d like to apply if the user selects our code fix. We do this with context.RegisterCodeFix(). We pass in a title, a function that returns a solution with our change and an optional equivalence key. The title is simply what will be displayed to the user when they see our fix as an option. In the default template it’s “Make uppercase” which you can see below:

Clicking on the code fix runs MakeUppercaseAsync. There’s admittedly a lot of overhead here for what seems like a trivial change. The real work occurs in Renamer.RenameSymbolAsync() an API that quickly and easily renames symbols for us across an entire solution. Remember that Roslyn objects are immutable, so we are given an entirely new solution (newSolution) which we return from our method. Now Visual Studio will replace the previous solution with our updated copy.

One final note to make is regarding equivalenceKey. The equivalence key is used to match our code fix against other code fixes and see whether or not they’re the same. To my knowledge, there’s no commonly agreed upon format for these keys. However it looks like projects such as StyleCopAnalyzers are using a similar approach to Microsoft and name theirs with a two letter code followed by a number (eg. SA1510CodeFixProvider).
And there you have it. That’s the base case analyzer that ships with Visual Studio. Obviously we can build much more powerful analyzers and code fixes, but this project should serve as a nice starting point for most people. For more advanced analyzers check out StyleCopAnalyzers, Code Cracker or the Roslyn Analyzers.

Alive Beta Released!

Today we’re pleased to release the Alive Beta. This is our biggest release since our initial alpha launch one and a half months ago. The graduation from alpha to beta is more than just a name change and includes a bunch of new features and bugfixes.

Overhauled UI

Alive’s original UI did a poor job telling you what was going on. We’ve rebuilt it to explicitly show you what’s going on when you’re using Alive. We’ve now made it clear when Alive is working, when it has encountered an error and when execution has succeeded.

Keyboard Shortcuts

First posted on our public issue tracker by RaffaelG, this feature has been long overdue. Now you can launch Alive directly from the keyboard by pressing: Ctr-[, Ctr-[

Expanded Project Support

We’ve been trying hard to support more project types as we try to reach parity with Visual Studio. If Visual Studio can load and run it, we think Alive should be able to as well.

It’s taken a lot of work, but we’ve made progress on ASP .Net 5 projects. The Alive beta now supports DNX projects that target the full framework. We’re working hard to support .Net Core 5 as well, but it’s been difficult with a lot of this stuff being undocumented and in beta. On the bright side, we plan to have support for .Net Core 5 in our v1.0 release.

Misc Features and Bug Fixes

As usual, we’ve included a number of bug fixes and miscellaneous improvements.

You can now invoke static generic methods directly
You can now pass any parameters to static methods
Improved method tracking

The Full Release Notes may be found on our issue tracker.

If you’re interested in using Alive, you can purchase a license from our website.

You can update your copy of Alive from within Visual Studio or from the Visual Studio Extension Gallery.

LRN Quick Tip: Bridging Visual Studio and Roslyn

Today I was working on a sample Visual Studio extension and I had the following question:

Given a Visual Studio ITextSnapshot or ITextBuffer how can I get the corresponding Roslyn Document?

It turns out there’s a bunch of extension methods that make this easy. They’re not shipped with the Microsoft.CodeAnalysis NuGet package so you’ll need to pull them down manually:

Install the EditorFeatures NuGet package:
Install-Package Microsoft.CodeAnalysis.EditorFeatures.Text
Include the appropriate a using statement:
using Microsoft.CodeAnalysis.Text;

Now you’ll be able to map from ITextBuffer and ITextSnapshot back to Roslyn’s Document, SourceText and Workspace objects.

Introducing Alive

tl;dr: We built this:

Write code. Compile. Run.

To most of us, this is what programming “is”. But when we really stop to think about it, this is a pretty bizarre way to create. We’ve divided up programming into these separate worlds of “design-time” and “debug-time”.

At design-time we make our changes, we imagine what the computer will do and start the program up. At “debug-time” we test our theories and see if we were right.

It’s like painting with a blindfold on. Once blindfolded, we make our marks and do our best to imagine how we’d like our painting to look. Occasionally we remove the blindfold see how badly we’ve done. Then we get to put it back on and try to correct our mistakes.

This seemed backwards to us. We wanted a frictionless design loop. One where we gain an instant connection to the programs we create.

So we built a tool to do this.

What is it?

Alive is a live programming environment for Visual Studio that currently supports C#. It blurs the line between “design-time” and “run-time”. As you write your code, you’re shown exactly what the code will do at runtime. It responds instantly as you update variables and manipulate code.

In a sense, Alive lets you travel through time. You can rewind and fast forward through loops. You can jump up and down the callstack of a recursive method call. This is fantastic for quickly tracking down bugs and edge cases.

See Alive in action:

What can it do?

Alive really shines when you’ve written unit tests and can run any code touched by one of your tests. You simply select the test you’d like to run against your method and watch as the values pop into existence. This is a lifesaver when trying to understand or modify code written by someone else.

Alive can also run any static method if you’re willing to provide parameters for it. This is useful when exploring and playing around with small programs. Essentially Alive can act as a replacement for a REPL and allow you to iterate and experiment quickly.

What can it not do?

It’s important to note that areas that Alive was not built to support. Alive isn’t very good at working with long running code. If you’re backtesting stock data, Alive probably isn’t the tool you’re looking for. This is because occasionally we’re re-running your code. We’re bound to the speed of your code and can’t show you the results until your code is finished running. (We’re working on providing partial results to you as you type. Unfortunately, this functionality won’t make it into the June alpha).

Alive should be used with caution when running code with real world side effects. If you’re constantly adding items to your database while using Alive you should expect to see a few extras items in there. We’re not running your code on every keystroke, but we are running it a handful of times as you type.

Alive is not great for running against non-deterministic code. If your unit tests periodically don’t run certain pieces of code, it can be difficult for us to know which unit tests correspond to which methods. You can work around this issue by selecting the unit tests you know will hit a given method.

What’s next?

Our next major milestone will be the alpha release on June 1. We’re currently testing Alive against various unit test frameworks including XUnit, NUnit and MSTest. We’re still working to solve some bugs and improve performance on large projects.

To save 30% on a licesnse, you can pre-order at https://comealive.io/#preorder. The license will be valid for one year after the release of v1.0. You may use Alive after the license expires, but you will no longer receive updates and bugfixes for Alive.

For updates on Alive subscribe to our newsletter at https://comealive.io/#newsletter

Learn Roslyn Now: Part 10 Introduction to Analyzers

Roslyn analyzers allow companies and individuals to enforce certain rules within a code base. My understanding is that there are two primary uses for analyzers:

Broadly enforce coding styles and best practices
Specifically guide individuals consuming a library

The first use is largely a replacement for tools like StyleCop and FxCop. We can use analyzers to enforce stylistic choices like “All private variables must start with a lowercase letter” and “Use spaces not tabs”. In fact, you can start using StyleCop.Analyzers today. From a NuGet command line simply use:

Install-Package StyleCop.Analyzers -Pre

The second use is to release library specific analyzers meant to guide consumers of your library. For example, we might want to ensure that no one does the following:

	var dateTime = System.DateTime.UtcNow;
	dateTime.AddDays(1);

view raw

ThatsWrongDude_DateTimeIsImmutable.cs

hosted with ❤ by GitHub

System.DateTime is immutable, so the above code is misleading. Instead the user should have written the following:

	var dateTime = System.DateTime.UtcNow;
	dateTime = dateTime.AddDays(1);

view raw

NiceFix.cs

hosted with ❤ by GitHub

Analyzers allow library authors to help guide their users. In that sense, I hope that it becomes standard to release a set of analyzers alongside new libraries. It’s difficult to say if this will actually happen, as it requires extra work from library authors.

Download the Roslyn SDK Templates

The templates do not ship with Visual Studio 2015. To install them go to:

Tools > Extensions and Updates > Online.

Search for “Roslyn SDK” and find the templates that correspond to your version. I’m using Visual Studio 2015 RC. I’ve chosen the package selected below:

After installing the templates, you must restart Visual Studio.

Creating your first analyzer

Navigate to:

File > New Project > Extensibility > Analyzer with Code Fix

Give your analyzer a name and click “OK”. I’ve taken the creative liberty of naming mine "Analyzer1". From here we’re presented a README that explains that building our project creates both a .vsix for Visual Studio and a .nupkg for submission to NuGet. There are also instructions on how to properly distribute your analyzer as a NuGet package.

Let’s take a look at what we’re given right out of the box:

We’re given three projects:

Analyzer1 – The brain of our analyzer. This is where all code analysis is done and code fixes are figured out.
Anylzer1.Test – A default test project with some helper classes to make testing easier.
Analyzer.Vsix – The startup project that will be deployed to Visual Studio. The .vsixmanifest tells Visual Studio that you’d like to export an analyzer and a code fix.

To run the project, simply press F5. A new instance of Visual Studio will launch. This Visual Studio is called the Experimental Hive and has its own set of settings within the Windows Registry. Note: It’s a good practice to choose a different theme for your Experimental Hive so you don’t get them mixed up.

Once you open a solution, you’ll notice Visual Studio complaining about a lot of new warnings. The analyzer we’re running simply creates a warning when it sees any type with lowercase letters in its name. It’s obviously not very useful, but allows us to also demonstrate the code fix included in this sample:

Now that we’ve got a rough idea of what each project is for, we’ll explore Analyzer1 and what we’re given for free.

DiagnosticAnalyzer.cs

The first thing to notice is that our Analyzer inherits from the abstract class DiagnosticAnalyzer. This class expects us to do two things:

Expose a set of diagnostics our analyzer is responsible for via SupportedDiagnostics.
Initialize our analyzer via Initialize(AnalysisContext).

Let’s take a look at the properties and fields in the first half of the file:

	[DiagnosticAnalyzer(LanguageNames.CSharp)]
	public class Analyzer1Analyzer : DiagnosticAnalyzer
	{
	public const string DiagnosticId = "Analyzer1";

	// You can change these strings in the Resources.resx file. If you do not want your analyzer to be localize-able, you can use regular strings for Title and MessageFormat.
	internal static readonly LocalizableString Title = new LocalizableResourceString(nameof(Resources.AnalyzerTitle), Resources.ResourceManager, typeof(Resources));
	internal static readonly LocalizableString MessageFormat = new LocalizableResourceString(nameof(Resources.AnalyzerMessageFormat), Resources.ResourceManager, typeof(Resources));
	internal static readonly LocalizableString Description = new LocalizableResourceString(nameof(Resources.AnalyzerDescription), Resources.ResourceManager, typeof(Resources));
	internal const string Category = "Naming";

	internal static DiagnosticDescriptor Rule = new DiagnosticDescriptor(DiagnosticId, Title, MessageFormat, Category, DiagnosticSeverity.Warning, isEnabledByDefault: true, description: Description);

	public override ImmutableArray<DiagnosticDescriptor> SupportedDiagnostics { get { return ImmutableArray.Create(Rule); } }

	…
	}

view raw

FirstHalfAnalyzer.cs

hosted with ❤ by GitHub

It may seem overwhelming at first, but bear with me. First notice the DiagnosticAnalyzer attribute applied to the class. This specifies what language or languages our analyzer will be run on. Today, you can only specify C# and VB .Net.

Looking within the class, the first five properties are simply strings to describe our analyzer and provide messages to users list. By default, the analyzer is set up to encourage localization and allows you define your title, message format and description as localizable strings. However if localization scares you like it does me, you make them simple strings.

Take a moment to look at DiagnosticDescriptor Rule. It defines a DiagnosticSeverity of “Warning”. I suspect you’ll likely want to stick with Warning, but if you feel like imposing on consumers of your analyzer, you could upgrade the severity to Error and prevent compilation completely. Note: I don’t recommend this. If your analyzer misbehaves and reports errors where there are none, the user will remove it.

Finally, lets take a look at the two generated methods:

	[DiagnosticAnalyzer(LanguageNames.CSharp)]
	public class Analyzer1Analyzer : DiagnosticAnalyzer
	{
	…

	public override void Initialize(AnalysisContext context)
	{
	// TODO: Consider registering other actions that act on syntax instead of or in addition to symbols
	context.RegisterSymbolAction(AnalyzeSymbol, SymbolKind.NamedType);
	}

	private static void AnalyzeSymbol(SymbolAnalysisContext context)
	{
	// TODO: Replace the following code with your own analysis, generating Diagnostic objects for any issues you find
	var namedTypeSymbol = (INamedTypeSymbol)context.Symbol;

	// Find just those named type symbols with names containing lowercase letters.
	if (namedTypeSymbol.Name.ToCharArray().Any(char.IsLower))
	{
	// For all such symbols, produce a diagnostic.
	var diagnostic = Diagnostic.Create(Rule, namedTypeSymbol.Locations[0], namedTypeSymbol.Name);

	context.ReportDiagnostic(diagnostic);
	}
	}
	}

view raw

SecondHalf.cs

hosted with ❤ by GitHub

The Initialize() method sets up the analyzer by registering the AnalyzeSymbol method to fire when semantic analysis has been run on a NamedType symbol. This is only one example out of a handful of ways to trigger an analyzer. We can register our analyzer to run on various triggers including compilation, analysis of codeblocks and analysis of syntax trees. We’ll flush out AnalysisContext in further posts.

The AnalyzeSymbol() method is where we actually do the analysis we’ve been talking about. This is where we would use the Syntax Tree and Symbol APIs to diagnose and report issues. In the case of this analyzer it simply takes the INamedTypSymbol provided and checks whether any of the characters in its name are lowercase. If they are, we report this diagnostic using the Rule we defined earlier.

This may seem like an awful lot of boilerplate for such a simple analyzer. However, once you start building complicated analyzers, you’ll find that the analysis code quickly starts to dominate and that the boilerplate isn’t so bad.

Next time, we’ll explore the CodeFixProvider and how we can offer solutions to problems we find in a user’s code.

LRN Quick Tip: PCL References and MSBuildWorkspace

2015/07/23 Edit: All of these problems should now be fixed in the latest Roslyn NuGet packages

We first looked at MSBuildWorkspace in Part 6 Working with Workspaces. MSBuildWorkpace works really well when loading up solutions from .sln files. It properly understands .csproj files so we don’t have to worry about tracking down references, documents, or MSBuild targets.

However, when compiling solutions that contained Portable Class Libraries (PCLs) I had been continuously running into frustrating problems with missing references to System.Runtime.dll. For example I’d see a handful of errors like:

error CS0012: The type 'Object' is defined in an assembly that is not referenced. You must add a reference to assembly 'System.Runtime, Version=4.0.0.0, Culture=neutral, PublicKeyToken=b03f5f7f11d50a3a'.

I’d also seen this issue pop up within Visual Studio when working on XAML and when debugging.

After learning some more about PCLs, it turns out that PCLs must reference facade assemblies that delegate the actual work to proper assemblies later. For the full story on PCLs and facade references see: http://alxandr.me/2014/07/20/the-problems-with-portable-class-libraries-and-the-road-to-solving-them/

In our case we needed to add a reference to System.Runtime.dll. I won’t bother to show the code for this as it’s fraught with its own set of problems. Although it resolves our System.Object reference, we quickly run into other problems with other types such as System.Tasks.Task. Manually adding these references was definitely not going to scale.

The Workaround

Originally this problem was reported as a bug within MSBuildWorkspace. After all, why wouldn’t it resolve the references properly when we open the solution? Visual Studio obviously seemed to be capable of figuring the references out…

It turns out there’s a MSBuild property called CheckForSystemRuntimeDependency. And we can set this to true and all our PCL worries seem to go away.

Simply create your MSBuildWorkspace with:

var ws = MSBuildWorkspace.Create(new Dictionary<string, string> { { "CheckForSystemRuntimeDependency", "true" } }); 

view raw

MSBuildWorkspace.cs

hosted with ❤ by GitHub

That’s it. That’s all. And best of all the Roslyn team will be removing this property when Roslyn v1.0 ships.

More info may be found at: https://github.com/dotnet/roslyn/issues/560

Thanks to @Pilchie and @JasonMalinowski for helping out with this problem and for the workaround!

LRN Quick Tips: Don’t trust SyntaxNode.ToFullString()

At Code Connect we’ve been working hard on a project that rewrites a user’s code and then compiles the rewritten solution. Without giving too much away, it essentially acts as a logger and can intercept all variable assignments.

So the following code:

	class MyClass
	{
	void MyMethod()
	{
	int x = 5;
	}
	}

view raw

BasicClass.cs

hosted with ❤ by GitHub

Is rewritten to:

	class MyClass
	{
	void MyMethod()
	{
	int x = LogValue("x", 5);
	}
	public static T LogValue<T>(string name, T value)
	{
	//Write to file
	//…
	return value;
	}
	}

view raw

Rewritten.cs

hosted with ❤ by GitHub

At some point we decided we no longer wanted to inject the LogValue method directly into the code we were instrumenting. We would place it in an entirely different namespace: CodeConnect.Instrumentation.

Here was how we originally created the invocation to LogValue:

	var newNode = SyntaxFactory.InvocationExpression(
	SyntaxFactory.IdentifierName(
	"LogValue"))
	.WithArgumentList(
	//…
	);

view raw

ValidInvocation.cs

hosted with ❤ by GitHub

So we figured we’d incorporate our new namespace as follows:

	var newNode = SyntaxFactory.InvocationExpression(
	SyntaxFactory.IdentifierName(
	"CodeConnect.Instrumentation.LogValue"))
	.WithArgumentList(
	//…
	);

view raw

InvalidInvocation.cs

hosted with ❤ by GitHub

At first glance, everything checked out. Calling .ToFullString() on this node revealed what looked like a proper invocation:

CodeConnect.Instrumentation.LogValue("x", 5);

However, try as we may we could not get our code to compile. We would constantly errors telling us the CodeConnect.Instrumentation type didn’t exist:

CS0103 at line 12 (character 19): The name 'CodeConnect.Instrumentation' does not exist in the current context.

We checked our references, we checked our rewritten solution and we checked the spelling at least a dozen times. Everything checked out. Eventually we did the classic “Which change broken this functionality?” walk through out Git history. (Note: If I had written more complete unit tests when I first wrote the rewriter, we would have caught this a lot quicker!)

We realized it was the change to the rewriter that introduced this problem. It took some time before we realized exactly what was wrong. Once again the ever-truthful Syntax Visualizer (now packaged with the .NET Compiler Tools SDK) came to the rescue and showed us what the syntax tree for a valid invocation of CodeConnect.Instrumentation.LogValue("x", 5); would look like:

After looking over the above tree, we realized we were supposed to create a chain of SimpleMemberAccessExpressions, not a single InvocationExpression with the name “CodeConnect.Instrumentation.LogValue”. When the binder when to bind this invocation to a symbol, it failed as no method can be declared with this name. The tree we had created was invalid and could never have been parsed out of a user’s source code.

This is a key point to understand when creating or rewriting syntax trees:

No one will stop you from creating invalid syntax trees.

Or from Kevin Pilch-Bisson on Stack Overflow:

> The Roslyn Syntax construction API does not guarantee that you can only build valid programs.

We can take this to its logical extreme and create trees that don’t make any sense. For example, we can create WhitespaceTrivia whose value is “public void M() { }” .

	//Make some impossible non-whitespace trivia
	var trivia = SyntaxFactory.SyntaxTrivia(SyntaxKind.WhitespaceTrivia,
	@"
	public void M() {

	}");
	var node = SyntaxFactory.ClassDeclaration(" test")
	.WithOpenBraceToken(
	SyntaxFactory.Token(SyntaxKind.OpenBraceToken)
	.WithTrailingTrivia(trivia)
	);

	Console.WriteLine(node.ToFullString());

view raw

InvalidTrivia.cs

hosted with ❤ by GitHub

The above prints the following misleading string:

class test{ public void M() {

}}

The output is misleading as it causes the reader to make assumptions about how the syntax tree must be formed. In my experience the only true arbiter of truth when it comes to syntax tree is the Roslyn Syntax Visualizer. I’d love for it to be expanded to visualize in memory trees while debugging.

The takeaways here:

Don’t trust .ToString() or .ToFullString()
Understand that you may accidentally generate invalid trees
Write tests

Learn Roslyn Now: Part 9 Control Flow Analysis

Control flow analysis is used to understand the various entry and exit points within a block of code and to answer questions about reachability. If we’re analyzing a method, we might be interested in all the points at which we can return out of the method. If we’re analyzing a for-loop, we might be interested in all the places we break or continue.

We trigger control flow analysis via an extension method on the SemanticModel. This returns an instance of ControlFlowAnalysis to us that exposes the following properties:

EntryPoints – The set of statements inside the region that are the destination of branches outside the region.
ExitPoints – The set of statements inside a region that jump to locations outside the region.
EndPointIsReachable – Indicates whether a region completes normally. Returns true if and only if the end of the last statement is reachable or the entire region contains no statements.
StartPointIsReachable – Indicates whether a region can begin normally.
ReturnStatements – The set of returns statements within a region.
Succeeded – Returns true if and only if analysis was successful. Analysis can fail if the region does not properly span a single expression, a single statement, or a contiguous series of statements within the enclosing block.

Basic usage of the API:

	var tree = CSharpSyntaxTree.ParseText(@"
	class C
	{
	void M()
	{

	for (int i = 0; i < 10; i++)
	{
	if (i == 3)
	continue;
	if (i == 8)
	break;
	}
	}
	}
	");

	var Mscorlib = PortableExecutableReference.CreateFromAssembly(typeof(object).Assembly);
	var compilation = CSharpCompilation.Create("MyCompilation",
	syntaxTrees: new[] { tree }, references: new[] { Mscorlib });
	var model = compilation.GetSemanticModel(tree);

	var firstFor = tree.GetRoot().DescendantNodes().OfType<ForStatementSyntax>().Single();
	ControlFlowAnalysis result = model.AnalyzeControlFlow(firstFor.Statement);

	Console.WriteLine(result.Succeeded); //True
	Console.WriteLine(result.ExitPoints.Count()); //2 – continue, and break

view raw

BasicControlFlow.cs

hosted with ❤ by GitHub

Alternatively, we can specify two statements and analyze the statements between the two. The following example demonstrates this and the usage of EntryPoints:

	var tree = CSharpSyntaxTree.ParseText(@"
	class C
	{
	void M(int x)
	{
	L1: ; // 1
	if (x == 0) goto L1; //firstIf
	if (x == 1) goto L2;
	if (x == 3) goto L3;
	L3: ; //label3
	L2: ; // 2
	if(x == 4) goto L3;
	}
	}
	");

	var Mscorlib = PortableExecutableReference.CreateFromAssembly(typeof(object).Assembly);
	var compilation = CSharpCompilation.Create("MyCompilation",
	syntaxTrees: new[] { tree }, references: new[] { Mscorlib });
	var model = compilation.GetSemanticModel(tree);

	//Choose first and last statements
	var firstIf = tree.GetRoot().DescendantNodes().OfType<IfStatementSyntax>().First();
	var label3 = tree.GetRoot().DescendantNodes().OfType<LabeledStatementSyntax>().Skip(1).Take(1).Single();

	ControlFlowAnalysis result = model.AnalyzeControlFlow(firstIf, label3);
	Console.WriteLine(result.EntryPoints); //1 – Label 3 is a candidate entry point within these statements
	Console.WriteLine(result.ExitPoints); //2 – goto L1 and goto L2 and candidate exit points

view raw

EntryAndExitPoints.cs

hosted with ❤ by GitHub

In the above example, we see an example of a possible entry point label L3. To the best of my knowledge, labels are the only possible entry points.

Finally, we’ll take a look at answering questions about reachability. In the following, neither the start point or the end point is reachable:

	var tree = CSharpSyntaxTree.ParseText(@"
	class C
	{
	void M(int x)
	{
	return;
	if(x == 0) //-+ Start is unreachable
	System.Console.WriteLine(""Hello""); // \|
	L1: //-+ End is unreachable
	}
	}
	");

	var Mscorlib = PortableExecutableReference.CreateFromAssembly(typeof(object).Assembly);
	var compilation = CSharpCompilation.Create("MyCompilation",
	syntaxTrees: new[] { tree }, references: new[] { Mscorlib });
	var model = compilation.GetSemanticModel(tree);

	//Choose first and last statements
	var firstIf = tree.GetRoot().DescendantNodes().OfType<IfStatementSyntax>().Single();
	var label1 = tree.GetRoot().DescendantNodes().OfType<LabeledStatementSyntax>().Single();

	ControlFlowAnalysis result = model.AnalyzeControlFlow(firstIf, label1);
	Console.WriteLine(result.StartPointIsReachable); //False
	Console.WriteLine(result.EndPointIsReachable); //False

view raw

UnreachableCode.cs

hosted with ❤ by GitHub

Overall, the Control Flow API seems a lot more intuitive than the Data Flow Analysis API. It requires less knowledge of the C# specification and is straightforward to work with. At Code Connect, we’ve been using it when rewriting and logging methods. Although it looks like no one has experimented much with this API, I’m really interested to see what uses others will come up with.