Richard Seroter's Architecture Musings

Author: Richard Seroter

Do you know the Microsoft Customer Advisory Teams? You should.

For those who live and work with Microsoft application platform technologies, the Microsoft Customer Advisory Teams (CAT) are a great source of real-world info about products and technology. These are the small, expert-level teams whose sole job is to make sure customers are successful with Microsoft technology. Last month I had the pleasure of presenting to both the SQL CAT and Server AppFabric CAT teams about blogging and best practices and thought I’d throw a quick plug out for these groups here.

First off, the SQL CAT team (dedicated website here) has a regular blog of best practices, and link to the best whitepapers for SQL admins, architects, and developers. I’m not remotely a great SQL Server guy, but I love following this team’s work and picking up tidbits that make me slightly more dangerous at work. If you actually need to engage these guys on a project, contact your Microsoft rep.

As for the Windows Server AppFabric CAT team, they also have a team blog with great expert content. This team, which contains the artists-formerly-known-as-BizTalk-Rangers, provides deep expertise on BizTalk Server, Windows Server AppFabric, WCF, WF, AppFabric Caching and StreamInsight. You’ll find a great bunch of architects on this team including Tim Wieman, Mark Simms, Rama Ramani, Paolo Salvatori and more, all led by Suren Machiraju and the delightfully frantic Curt Peterson. They’ve recently produced posts about using BizTalk with the AppFabric Service Bus, material on the Entity Framework, and a ridiculously big and meaty post from Mark Simms about building StreamInsight apps.

I highly recommend subscribing to both these team blogs and following SQL CAT on twitter (@sqlcat).

August 26, 2010
How Intelligent is BizTalk 2010’s Intelligent Mapper?

One of the interesting new features of the BizTalk Server 2010 Mapper (and corresponding Windows Workflow shape) is the “suggestive matching” which helps the XSLT map author figure out which source (or destination) nodes are most likely related. The MSDN page for suggestive matching has some background material on the feature. I thought I’d run a couple quick tests to see just how smart this new mapper is.

Before the suggestive match feature was introduced, we could do bulk mapping through the “link by” feature. With that feature, you could connect two parent nodes and choose to map the children nodes based on the structure (order), exact names or through the mass copy function. However, this is a fairly coarse way to map that doesn’t take into account the real semantic differences in a map. It also doesn’t help you find any better destination candidates that may be in a different section of the schema.

Through Suggestive Matching, I should have an easier time finding matching nodes with similar, but non-exact naming. However, per the point of this post, I wasn’t sure if the Mapper just did a simple comparison or anything further.

Simple Name Matching

In this scenario, we are simply checking to see if the Mapper looks for the same textual value from the source in the destination. In my source schema I have a field called “ID.” In my destination schema I have a field called “ItemID.” As you’d expect, the suggestive match points this relationship out.

In that case, the name of the source node is a substring of the destination. What if the destination node is a substring of the source? To demonstrate that, I have a source field named “PhoneNumber” and the destination node is named “Phone.” Sure enough, a match is still made.

Also, it doesn’t matter where in the node name that a matching value is found. If I have a “Code” field in the source tree and both a “ZipCode” and “OrderCodeIdentifier” in the destination, both nodes are considered possible matches. The word “code” in the latter field, although between other text, is still identified as a match. Not revolutionary of course, but nice.

Complex Name Matching

In this scenario, I was looking to see if the Mapper detected any differences based on more than just the substrings. That is, could it figure out that “FirstName” and “FName” are the same? Unfortunately, the “FirstName” field below resulted in a match to all name fields in the destination.

The highlighted link is considered the best match, and I noticed that as I added more characters to the “FName” node, I got a different “best match.”

You see that “FirName” is considered a close match to “FirstName.” Has anyone else found any cases where similar but inexact worded is still marked as a match?

Node Positioning

I was hoping that via intelligent mapping that an address with a similar structure could be matched across. That is, if in one map I had certain identically named nodes before an after one, that it might guess that the middle ones matched. For instance, what if I have “City” between “Street” and “State” in the source and “Town” between “Street” and “State” in the destination, that maybe it would detect a pattern. But alas, that is apparently a dream.

Summary

It looks like our new intelligent mapper, with the help of Suggestive Match, does a decent job of textual matching between a source and destination schema. I have yet to see any examples of advanced conditions outside of that. Still, if all we get is textual matching, that still provides developers a bit of help when traversing monstrous schemas with multiple destination candidates for a source node.

If you have any additional experiences with this, I’d love to hear it.

August 15, 2010
My Book is Available for Pre-order on Amazon.com

Just a quick FYI that my new book, Applied Architecture Patterns on the Microsoft Platform, can now be pre-ordered on Amazon.com. I’m reviewing final prints now, so hopefully we’ll have this out the door and in your hands shortly.

August 2, 2010
Cloud Provider Request: Notification of Exceeded Cost Threshold

I wonder if one of the things that keeps some developers from constantly playing with shiny cloud technologies is a nagging concern that they’ll accidentally ring up a life-altering usage bill. We’ve probably all heard horror stories of someone who accidentally left an Azure web application running for a long time or kept an Amazon AWS EC2 image online for a month and were shocked by the eventual charges. What do I want? I want a way to define a cost threshold for my cloud usage and have the provider email me as soon as I reach that value.

Ideally, I’d love a way to set up a complex condition based on various sub-services or types of charges. For instance, If bandwidth exceeds X, or Azure AppFabric exceeds Y, then send me an SMS message. But I’m easy, I’d be thrilled if Microsoft emailed me the minute I spent more than $20 on anything related to Azure. Can this be that hard? I would think that cloud providers are constantly accruing my usage (bandwidth, compute cycles, storage) and could use an event driven architecture to send off events for computation at regular intervals.

If I’m being greedy, I want this for ANY variable-usage bill in my life. If you got an email during the summer from your electric company that said “Hey Frosty, you might want to turn off the air conditioner since it’s ten days into the billing cycle and you’ve already rung up a bill equal to last month’s total”, wouldn’t you alter your behavior? Why are most providers stuck in a classic BI model (find out things whenever reports are run) vs. a more event-driven model? Surprise bills should be a thing of the past.

Are you familiar with any providers who let you set charge limits or proactively send notifications? Let’s make this happen, please.

July 26, 2010
Using “Houston” to Manage SQL Azure Databases

Up until now, your only option for managing SQL Azure cloud databases was using an on-premise SQL Server Management Console and pointing to your cloud database. The SQL Azure team has released a CTP of “Houston” which is a web-based, Silverlight environment for doing all sorts of stuff with your SQL Azure database. Instead of just telling you about it, I figured I’d show it.

First, you need to create a SQL Azure database (assuming that you don’t already have one). Mine is named SeroterSample. I’m feeling very inspired this evening.

Next up, we make sure to have a firewall rule allowing Microsoft services to access the database.

After this, we want to grab our database connection details via the button at the bottom of the Databases view.

Now go to the SQL Azure labs site and select the Project Houston CTP 1 tab.

We then see a futuristic console which either logs me into project Houston or launches a missile.

If the login is successful, we get the management dashboard. It contains basic management operations at the top (“new table”, “new stored procedure”, “open query”, etc), a summary of database schema objects on the left, and an unnecessary but interesting “cube of info” in the middle.

The section in the middle (aka “cube of info”) rotates as you click the arrows and shows various data points. Hopefully a future feature includes a jack-in-the-box that pops out of the top.

I chose to create a new table in my database. We are shown an interface where we build up our table structure by choosing columns, data types, default values, data types and more.

After creating a few columns and renaming my table, I clicked the Save button on the top left of the screen to commit my changes. I can now see my table in the list of artifacts belonging to my database.

It’s great to have a table, but let’s put some data into that bad boy. Clicking the table name re-opens the design view by default. We can select the Data view at the top to actually add rows to our table.

I’m not exactly sure how to delete artifacts except through manual queries. For kicks and giggles I clicked the New View option, and when I canceled out of it, I still ended up with a view in the artifact list. Right-clicking is not something that is available anywhere in the application, and there was no visible way to delete the view short of create a new Query and deleting it from there. That said, when I logged out and logged back in, the view was no longer there. So, because I didn’t explicitly save it, the view was removed when I disconnected.

All in all, this is a fine, light-weight management interface for our cloud database. It wasn’t until I was halfway through my demonstration that I realized that I did all my interactions on the portal through a Chrome browser. Cross-browser stuff is much more standard now, but, still nice to see.

Because I have no confidence that my Azure account is accurately tied to my MSDN Subscription, I predict that this demonstration has cost me roughly $14,000 in Azure data fees. You all are worth it though.

July 22, 2010
Sending StreamInsight Events to BizTalk Through New Web (SOAP/REST) Adapter
One StreamInsight usage scenario frequently discussed by the product team involves sending a subset of events (or aggregated complex events) to the Enterprise Service Bus for additional processing and distribution. As I’ve mentioned before, StreamInsight doesn’t come with any out-of-the-box adapters. So if you want to make this usage scenario a reality, it’s up to you to figure out how to do it. In this post, I hope to give you a head start (and code) to making this happen. I’ve built a StreamInsight web adapter which lets StreamInsight send either SOAP or REST-style messages to an endpoint. We can use this adapter to send messages to BizTalk, or any web endpoint. Buckle up, this is a long one.

Designing the Adapter

In the StreamInsight SDK you’ll find some solid examples of StreamInsight adapters that you can use as a template to build your own. I’ve built a few so far myself and demonstrate how to build an MSMQ publication adapter in my new book. But I hadn’t built a consumer adapter yet, so I had to think about the right design strategy.

The first design choice was whether to build a typed or untyped adapter. While typed adapters are easier to craft since you are building to a known data payload, you don’t get any reuse out of the adapter. So, the first (easy) decision was to build an untyped adapter that could send any payload to a web endpoint.

The second consideration was how to call the downstream web endpoint. I decided to use the System.Net.HttpWebRequest object to publish the payload and not try to do an IOC pattern with proxy classes. By using this mechanism, I can apply the same code to call a SOAP endpoint or invoke various HTTP verbs on a RESTful endpoint.

Finally, I had to decide how to actually convert the StreamInsight events to the expected XML payload of my web endpoints. I figured that leveraging XSLT was a solid plan. I can take the inbound event, and via a runtime configuration property, apply an XML transformation stylesheet to the event and produce output that my web endpoint requires.

Ok, with all of these considerations in place, let’s build the adapter. Note that you are completely allowed to disagree with any of the choices above and modify my adapter to fit your needs.

Building the Adapter

First off, I built the adapter’s configuration object. These are the settings that we apply at runtime when we bind a StreamInsight query to an adapter. Consider this to be reference data that we don’t want to hardcode into our adapter.
```
public struct WebOutputConfig
    {
        public string XslPath { get; set; }
        public string ServiceAddress { get; set; }
        public string HttpMethod { get; set; }
        public string SoapAction { get; set; }
        public bool IsSoap { get; set; }
    }
```
Note that my configuration accepts the path to an XSLT file, the URL of the target service, the HTTP method to apply, and if we are calling a SOAP endpoint, what the SOAP Action value is.

Next I create my actual adapter class. It inherits from the untyped PointOutputAdapter class.
```
public class WebPointOutput: PointOutputAdapter
    {
        //store reference to CEP event
        private CepEventType bindTimeEventType;
        private string serviceAddress;
        private string httpMethod;
        private string soapAction;
        private bool isSoap;
        private XslCompiledTransform consumerXform;

        public WebPointOutput(WebOutputConfig configInfo, CepEventType eventType)
        {
            this.bindTimeEventType = eventType;
            this.serviceAddress = configInfo.ServiceAddress;
            this.httpMethod = configInfo.HttpMethod;
            this.soapAction = configInfo.SoapAction;
            this.isSoap = configInfo.IsSoap;

            //load up transform
            consumerXform = new XslCompiledTransform(false);
            consumerXform.Load(configInfo.XslPath);
        }
  }
```
The adapter stores internal references to the configuration values it received and the constructor instantiates the XSL transformation object using the XSL path passed into the adapter.

Before writing the primary operation which calls the service, we need a helper function which takes the key/value pairs from the CEP event and creates a dictionary out of them. We will later convert this dictionary into a generic XML structure that we’ll apply our XSLT against.
```
private Dictionary<string, string> GetCepEventFields(PointEvent currentEvent)
        {
            Dictionary<string, string> cepFields = new Dictionary<string, string>();

            for (int ordinal = 0; ordinal < bindTimeEventType.FieldsByOrdinal.Count; ordinal++)
            {
                CepEventTypeField evtField = bindTimeEventType.FieldsByOrdinal[ordinal];
                cepFields.Add(evtField.Name, currentEvent.GetField(ordinal).ToString());
            }
            return cepFields;
        }
```
See above that I loop through all the fields in the event and add each one (name and value) to a dictionary object.

Now we can build our primary function which takes the StreamInsight event and calls the web endpoint. After the code snippet, I’ll comment on a few key points.
```
private void ConsumeEvents()
  {
      //create new point event
      PointEvent currentEvent = default(PointEvent);
      try
      {
          while (true)
          {
              if (AdapterState.Stopping == AdapterState)
              {
                 Stopped();
                 return;
              }

              if (DequeueOperationResult.Empty == Dequeue(out currentEvent))
             {
                 Ready();
                 return;
              }

             //only publish insert events and ignore CTIs
             if (currentEvent.EventKind == EventKind.Insert)
             {
                // ** begin service call
                //convert CEP message to XML for transformation
                XDocument intermediaryDoc = new XDocument(
                new XElement("Root",
                GetCepEventFields(currentEvent).Select(field => new XElement("Property",
                    new XElement("Name", field.Key),
                    new XElement("Value", field.Value)
                    ))));

                //transform CEP event fields to output format
                XDocument returnDoc = new XDocument();
                using (XmlWriter writer = returnDoc.CreateWriter())
                {
                  consumerXform.Transform(intermediaryDoc.CreateReader(), (XsltArgumentList)null, writer);
                }

                //call service
                HttpWebRequest req = (HttpWebRequest)HttpWebRequest.Create(serviceAddress);
                req.Method = httpMethod;
                req.ContentType = "text/xml";
                if (isSoap)
                    req.Headers.Add("SOAPAction", soapAction);

                using (Stream reqStream = req.GetRequestStream())
                {
                    var bytes = Encoding.UTF8.GetBytes(returnDoc.ToString());
                    reqStream.Write(bytes, 0, bytes.Length);
                    reqStream.Close();
                }

                var resp = (HttpWebResponse)req.GetResponse();
               }

             // Every received event needs to be released.
             ReleaseEvent(ref currentEvent);
         }
      }
      catch (AdapterException e)
      {
         System.IO.File.WriteAllText(
            @"C:\temp\" + System.Guid.NewGuid().ToString() + "_eventerror.txt", "Error: " + e.ToString());
       }
   }
```
First, notice that I do NOT emit CTI events. Next see that I use a bit of LINQ to take the results of the event-to-dictionary conversion and create an XML document (XDocument) consisting of name/value pairs. I then take this “intermediary XML” and pass it through an XslCompiledTransform using whichever XSLT was provided during adapter configuration. The resulting XML is then streamed to the web endpoint via the HttpWebRequest object. There are probably performance improvements that can be done here, but hey, it’s a proof-of-concept!

The final piece of this adapter is to fill in the required “start” and “resume” operations.
```
public override void Resume()
        {
            new Thread(this.ConsumeEvents).Start();
        }

        public override void Start()
        {
            new Thread(this.ConsumeEvents).Start();
        }

        protected override void Dispose(bool disposing)
        {
            base.Dispose(disposing);
        }
```
Finally, I have to create an adapter factory which spins up my adapter when the StreamInsight query gets started up. Since we are using an untyped adapter, there isn’t any logic needed to pick the “right” output adapter.
```
public class WebOutputFactory : IOutputAdapterFactory<WebOutputConfig>
 {
     public OutputAdapterBase Create(WebOutputConfig configInfo, EventShape eventShape, CepEventType cepEventType)
     {
         OutputAdapterBase adapter = default(OutputAdapterBase);
         adapter = new WebPointOutput(configInfo, cepEventType);

         return adapter;
     }
 }
```
With that, we have a complete StreamInsight consumer adapter.

Using the Adapter

How can we use this fancy, new adapter? In one scenario, we can use StreamInsight to process a high volume of events, filter out the “noise”, and amplify events of specific interest. Or, we can empower StreamInsight to look for trends within the stream over a particular time duration and share these complex events whenever one is encountered.

For this post, I’ll show the latter example. I have a StreamInsight application which generates call center events every half second and sends them to an embedded StreamInsight server. I do some aggregation over a window of time and if a complex event is detected, the web adapter is called and BizTalk receives the message for further processing. Note that nothing prevents me from substituting WCF Services or Azure-based services for BizTalk in this case. Well, except for security which I have NOT added to my adapter. Didn’t figure out a clean way to store and send credentials yet.

BizTalk Setup

Let’s set up the BizTalk application that StreamInsight will publish to. First I created a simple schema that represents the event data I want BizTalk to receive.

In real life I’d add an orchestration or two to process the event data, but this post is already ginormous and you all get the point. So, let’s jump right to exposing this schema as part of a BizTalk service contract. I walked through the BizTalk WCF Publishing Wizard and produced a one-way service that takes in my CallThresholdEvent message.

Once the service is created, I built the requisite receive port/location and a send port which subscribes on the CallThresholdEvent message.

All we need now is the right XSLT to transform the CEP event message to the WCF service contract message format. How do we get that? The easiest way to get the correct XML is to invoke the service in the WCF Test Client and steal the SOAP payload it builds to call the service. I pointed the WCF Test Client to my endpoint and invoked the service.

Once I confirmed that the service worked (and emitted a file from the send port), I switched the view from “formatted” to “xml” and could view the XML that was sent across the wire.

I took the “request” XML and created a new XSLT file with this request structure created in the root template.
```
<xsl:template match="*">
    <s:Envelope xmlns:s="http://schemas.xmlsoap.org/soap/envelope/">
      <s:Header>
        
      </s:Header>
      <s:Body xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xsd="http://www.w3.org/2001/XMLSchema">
        <CallThresholdEvent xmlns="http://BizTalkEventProcessor">
          <ProductName xmlns="">
            <xsl:value-of select="Property[Name = 'EvtProd']/Value"/>
          </ProductName>
          <CallCategory xmlns="">
            <xsl:value-of select="Property[Name = 'EvtType']/Value"/>
          </CallCategory>
          <OccuranceCount xmlns="">
            <xsl:value-of select="Property[Name = 'EvtCount']/Value"/>
          </OccuranceCount>
          <TimeReceived xmlns=""></TimeReceived>
        </CallThresholdEvent>
      </s:Body>
    </s:Envelope>
  </xsl:template>
```
Note that you should NOT send the Action header as WCF takes care of that and the service endpoint barfs with an HTTP 500 if you send it. It also takes roughly 96 hours to figure out that this is the problem. Consider yourself warned.

At this point, I have all I need in BizTalk to call the service successfully.

StreamInsight Setup

The first query in my StreamInsight application performs an aggregation of events over a “tumbling” window.
```
var inputStream = CepStream<CallCenterRequestEventType>.Create("input", typeof(CallCenterAdapterFactory), config, EventShape.Point);

 var callTypeCount =
          from w in inputStream
          group w by new { w.RequestType, w.Product } into appGroup
          from x in appGroup.TumblingWindow(
                  TimeSpan.FromSeconds(15),
                  HoppingWindowOutputPolicy.ClipToWindowEnd)
           select new EventTypeSummary
           {
               EvtType = appGroup.Key.RequestType,
               EvtProd = appGroup.Key.Product,
               EvtCount = x.Count()
            };
```
In the query above, I take the call center event input stream and put the incoming events into groups based on the event type (e.g. “Info Request”, “Product Complaint”, “Account Change”) and product the customer is calling about. I base these groups on a tumbling window that lasts 15 seconds. This means that the window is flushed every 15 seconds and started fresh. I then take the output of the window grouping and put it into a new, known type named EventTypeSummary. If I use an anonymous type here instead, I get a “System.IndexOutOfRangeException: Index was outside the bounds of the array” error.

I next take the result of the first query and make it the input into a second query. This one looks at any groups emitted by the first query and filters them based on a criteria my ESB is interested in.
```
var callTypeThreshold =
            from summary in callTypeCount
            where summary.EvtCount > 3 && summary.EvtType == "Product Complaint"
            select summary;
```
Above, I am looking for any “summary events” where the call type is a product complaint and there have been more than 3 of them for a specific product (during a given window).

Before I register my query, I need to define the StreamInsight adapter configuration for my web endpoint. Recall above that we defined a structure to hold parameters that we will pass into the adapter at runtime.
```
var webAdapterBizTalkConfig = new WebOutputConfig()
 {
    HttpMethod = "POST",
    IsSoap = true,
    ServiceAddress = "http://localhost/BizTalkEventProcessingService/BizTalkEventProcessingService.svc",
    SoapAction = "PublishThresholdEvent",
    XslPath = @"[path]\CallCenterEvent_To_BizTalkSoapService.xslt"
  };
```
Above, you’ll see the service address pointing to my BizTalk-generated WCF endpoint, the SOAP action for my service, and a pointer to the XSLT that I created to transform the CEP event to a SOAP payload.

Finally, I registered the query and start it.
```
var allQuery = callTypeThreshold.ToQuery(
                         myApp,
                         "Threshold Events",
                         string.Empty,
                         typeof(WebOutputFactory),
                         webAdapterBizTalkConfig,
                         EventShape.Point,
                         StreamEventOrder.FullyOrdered);
```
You can see that I pass in my web adapter factory type and the adapter configuration properties defined earlier.

The Result

When all this is in place, I start up my StreamInsight application, begin generating events, and can observe BizTalk messages getting written to disk.

In this post we saw how I can link StreamInsight with BizTalk Server through a WCF channel. You can grab the source code for the StreamInsight Web Adapter here. I’ve done some basic testing of the adapter against both RESTful and SOAP services, but there are great odds that you’ll find something I missed. However, it hopefully gives you a great head start when building a StreamInsight solution that emits events to web endpoints.
July 9, 2010
Updated Ways to Store Data in BizTalk SSO Store

One of my more popular tools has been the BizTalk SSO Configuration Data Storage Tool. At the time I built that, there was no easy way to store and manage Single Sign On (SSO) applications that were used purely for secure key/value pair persistence.

Since that time, a few folks (that I know of) have taken my tool and made it better. You’ll find improvements from Paul Petrov here (with update mentioned here), and most recently by Mark Burch at BizTorque.net. Mark mentioned in his post that Microsoft had stealthily released a tool that also served the purpose of managing SSO key/values, so I thought I’d give the Microsoft tool a quick whirl.

First off, I downloaded my own SSO tool, which I admittedly haven’t had a need to use for quite some time. I was thrilled that it worked fine on my new BizTalk 2010 machine.

I created (see above) a new SSO application named SeroterToolApp which holds two values. I then installed the fancy new Microsoft tool which shows up in the Start Menu under SSO Application Configuration.

When you open the tool, you’ll find a very simple MMC view that has Private SSO Application Configuration as the root in the tree. Somewhat surprisingly, this tool does NOT show the SSO application I just created above in my own tool. Microsoft elitists, think my application isn’t good enough for them.

So let’s create an application here and see if my tool sees it. I right-click that root node in the tree and choose to add an application. You see that I also get an option to import an application and choosing this prompts me for a “*.sso” file saved on disk.

After adding a new application, I right-clicked the application and chose to rename it.

After renaming it MicrosoftToolApp, I once again right-clicked the application and added a key value pair. It’s nice that I can create the key and set its value at the same time.

I added one more key/value pair to the application. Then, when you click the application name in the MMC console, you see all the key/value pairs contained in the application.

Now we saw earlier that the application created within my tool does NOT show up in this Microsoft tool, but what about the other way around? If I try and retrieve the application created in the Microsoft tool, sure enough, it appears.

For bonus points, I tried to change the value of one of the keys from my tool, and that change is indeed reflected in the Microsoft tool.

So this clearly shows that I am a much better developer than anyone at Microsoft. Or more likely, it shows that somehow the applications that my tool creates are simply invisible to Microsoft products. If anyone gets curious and wants to dig around, I’d be somewhat interested in knowing why this is the case.

It’s probably a safe bet moving forward to use the Microsoft tool to securely store key/value pairs in Enterprise Single Sign On. That said, if using my tool continues to bring joy into your life, than by all means, keep using it!

July 6, 2010
Interview Series: Four Questions With … Saravana Kumar

Happy July and welcome to the 22nd interview with a connected technology thought leader. Today we’re talking to Saravana Kumar who is an independent consultant, BizTalk MVP, blogger, and curator of the handy BizTalk 24×7 and BizTalk BlogDoc communities. The UK seems to be a hotbed for my interview targets, and I should diversify more, but they are just so damn cheery.

On with the interview!

Q: Each project requires the delivery team to make countless decisions with regards to the design, construction and deployment of the solution. However, there are typically a handful of critical decisions that shape the entire solution. Tell us a few of the most important decisions that you make on a BizTalk project.

A: Every project is different, but there is one thing common across all of them: having a good support model after its live. I’ve seen on numerous occasions projects missing out on requirement gathering to put a solid application support model. One of the key decisions I’ve made on the project I’m on is to use BizTalk’s Business Activity Monitoring (BAM) capabilities to build a solid production support model with the help of Microsoft Silverlight. I’ve briefly hinted about this here in my blog. There is a wide misconception, BAM is used only to capture key business metrics, but the reality is its just a platform capable of capturing key data at a high volume system in an efficient way. The data could be purely technical monitoring stuff not necessarily Business metrics. Now we get end to end visibility across various layers and a typical problem analysis takes minutes not hours.

Another important decision I make on a typical BizTalk project is to think about performance in very early stages. Typically you need to get the non-functional SLA requirements way upfront. Because this will effect some of the key decisions, a classic one is whether to use orchestrations or design the solution purely using messaging only pattern.

There are various other areas I’ll be interested to write here like DR, consistent build/deployment across multiple environment, consistent development solution structure, schema design etc. But in the interest of space I’ll move on to the next question!

Q: There are so many channels for discovering and learning new things about technology. What are your day-to-day means for keeping up to date, and where do you go to actually invest significant time in technology?

A: For the past few years ( 5-6 years) the discovery part for me is always blogs. You get the lead from there and if something interests you, you build up the links from there by doing further searching on the topic. I can quote on one of my recent experience on knowing about FMSB (Financial Messaging Service Bus). This is something built on top of our BizTalk ESB Toolkit for the vertical Financial services market. I just came to know about this from one of the blog posts, who came to know about this from chatting to someone in BizTalk booth during TechEd.

When it comes to learning part, my first preference these days are videos. We are living in the age of information overload, the biggest challenge is finding the right material. These days video materials gets to the public domain almost instantaneously. So, for example if I’m not going to PDC or TechEd, I normally schedule the whole thing as if like I’m attending the conference and go through the videos in next 3-4 weeks. This way I don’t miss out on any big news.

Q: As a consultant, how do you decide to recommend that a client uses a beta product like BizTalk Server 2010 or completely new product like Windows Azure Platform AppFabric? Do you find that you are generally more conservative or adventurous in your recommendations?

A: I work mainly with Financial services client, where projects and future directions are driven by Business and not by Technology. So, unless otherwise there is really pressing need from Business it will be difficult to recommend a cutting edge technology. I also strongly believe the technology is there to support the business and not vice versa. That doesn’t mean our applications are still running on Excel Macros and 90’s style VB 4.0 applications. Our state of the art BPM platform, which helps Business process paper application straight through processing (STP) right from opening the envelope to committing the deal in our AS 400 systems is built using BizTalk Server 2006. We started this project just after BizTalk Server 2006 was released (not Beta, but just after it RTM’ed). To answer your question, if there is a real value for Business in upcoming beta product, I’ll be heading in that direction. Whether I’m conservative or adventurous will depend on the steak. For BizTalk Server 2010 I’ll be bit adventurous to get some cheap wins (just platform upgrade is going to give us certain % of performance gain with minimal or no risk), but for technology like Azure either on premise or cloud I’ll be bit conservative and wait for the both right business need and maturity of the technology itself.

Q [stupid question]: It’s summertime, so that means long vacations and the occasional “sick day” to enjoy the sunshine. Just calling the office and saying “I have a cold” is unoriginal and suspicious. No, you need to really jazz it up to make sure that it sounds legitimate and maybe even a bit awkward or uncomfortable. For instance, you could say “I’m physically incapable of wearing pants today” or “I cut myself while shaving … my back.” Give us a decent excuse to skip work and enjoy a summer day.

A: As a consultant, I don’t get paid if I take day off sick. But that doesn’t stop me from thinking about a crazy idea. How about this : I ate something very late last night in the local kebab shop and since then I’m constantly burping every 5 minutes non-stop with a disgusting smell. 🙂

Thanks Saravana, and everyone enjoy their summer vacations!

July 1, 2010
Leveraging and Managing the StreamInsight Standalone Host
In my recent post that addressed the key things that you should know about Microsoft StreamInsight, I mentioned the multiple hosting options that are at your disposal. Most StreamInsight examples (and documentation) that you find demonstrate the “embedded” server option where the custom application that you build hosts the StreamInsight engine in-process. In this post, I’m going to dig into how you take advantage of the out-of-process standalone server for StreamInsight. I’m also going to give you a little application I created that fills the gaps in the visual tooling for StreamInsight.

If you chose to leverage the embedded server model, your code would probably start off something like this:
```
//create embedded server
using (Server server = Server.Create("RSEROTER"))
{

//create application in the embedded server
var myApp = server.CreateApplication("SampleEvents");

// .. create query, start query

}
```
This type of solution is perfectly acceptable and provides the developer with plenty of control over the way the queries are managed. However, you don’t get the high availability and reuse that the standalone server offers.

Creating the Host

So how do we use the remote, standalone host? When you install StreamInsight, you are given the option to create a server host instance.

Above, you can see that I created an instance named RSEROTER. When the installation is completed, a folder is created in the StreamInsight directory.

A Windows Service is also created for this instance, and it uses a configuration file from folder created above.

Configuring the Host

To be able to start this Windows Service, you’ll have to make sure that the endpoint address referenced in the service’s configuration file matches a registered endpoint for the server. The configuration file for this StreamInsight host looks like this:

The endpoint address for the StreamInsight Management Service needs to be one of the addresses in my server’s reserved list. Go to a command prompt and type netsh http show urlacl to see reserved endpoints and associated accounts. Mine looks like this:

If your addresses and permissions line up, your service will start just fine. If your StreamInsight Windows Service uses a logon account that doesn’t have rights to the reserved endpoint, then the Windows Service won’t start. If the values in the configuration file and the registered endpoint list differ, the service won’t start. If you plan on using both an embedded and standalone server model concurrently, you will want to register a different URL and port for the embedded endpoints.

In my case, I changed the user account associated with my registered endpoint so that the StreamInsight Windows Service could open the endpoint. First I deleted the existing registered entry by using netsh http delete urlacl url=http://localhost:80/StreamInsight/RSEROTER/ and then added a new entry back with the right account (Network Service in my case) via netsh http add urlacl url=http://localhost:80/StreamInsight/RSEROTER user=”Network Service”. The StreamInsight installation guide has more details on setting up the right user accounts to prevent “access is denied” errors when connecting the debugger or trying to create/read server applications.

Considerations for Standalone Host Model

Now that you have a StreamInsight server instance started up, what should you know? Unlike the “embedded” StreamInsight hosting model where your application starts up and runs the StreamInsight engine in process, the standalone model uses a remote connection-based strategy. The other thing to remember is that because you are using an out-of-process service, you also have to strong-name and GAC the assemblies containing your event payload definitions and adapters. Note that if you forget to start the Windows Service, you’ll get a warning that the WCF endpoint is in a faulted state. Finally, be aware that you can only explicitly create a management endpoint in code if you have an embedded server.

Before I show you how to deploy queries to this standalone host, I should tell you about the management activities you CANNOT do via the only graphical tool that StreamInsight provides, the StreamInsight Event Flow Debugger. The Debugger allows you to view existing applications, show queries included in applications, and both start and stop queries. What you CANNOT do graphically is create applications, delete applications and delete queries. So, I’ve built a tool that lets you do this.

The New StreamInsight Server Manager

Prior to writing code that connects to the StreamInsight server and deploys queries, I want to create the application container on the server. I open up my StreamInsight Server Manager, connect to my endpoint (value read from my application’s configuration file) and choose to Create a new server application.

Once you have an application, you can right-click it and choose to either Delete the application or view any queries associated with it.

Coding to and Using the Standalone Server Instance

Let’s write some code! I’ve built a console application that creates or starts a StreamInsight query. First off, I use a “connect” operation to link to my standalone server host.
```
//connect to standalone server
using(Server server = Server.Connect(new System.ServiceModel.EndpointAddress(@"http://localhost/StreamInsight/RSEROTER")))
{

}
```
I then find the application that I created earlier.
```
Application myApp;
//get reference to existing application
myApp = server.Applications["CallCenterEvents"];
```
If my query is already on the server, than this application will just start it up. Note that I could have also used my StreamInsight Server Manager or the Event Flow Debugger to simply start a server query. I don’t need a custom application for that if I have a standalone server model. But, this is what starting the query in code looks like:
```
//if query already exists, just start it
if (myApp.Queries.ContainsKey("All Events"))
{
Query eventQuery = myApp.Queries["All Events"];
eventQuery.Start();

//wait for keystroke to end
Console.ReadLine();

eventQuery.Stop();

}
```
If my query does NOT exist, then I create the query and start it up. When I start my custom application, I can see from the StreamInsight Event Flow Debugger that my query is running.

If I flip to my StreamInsight Server Manager application, I can also see the query (and it’s status).

Unlike the Event Flow Debugger, this application also lets you delete queries.

Because I’m using the standalone server host option, I could choose to stop my custom application and my query is still available on the server. I can now start and stop this query using the Event Flow Debugger or my StreamInsight Server Manager.

Summary

I expect that we’ll soon see more from Microsoft on building highly available StreamInsight solutions by using the standalone instance model. This model is a great way to get reuse out of adapters and queries and get metadata durability in a central server host. When using the standalone instance model you just have to remember the few things I pointed out above (e.g. using the GAC, getting the management endpoint set up right).

You can grab the executable and source code for the StreamInsight Server Manager here. As you can expect from me in these situations, this is hardly production code. But, it works fairly well and solves a problem. It also may prove a decent example of how to access and loop through StreamInsight applications and queries. Enjoy.
June 27, 2010
6 Things to Know About Microsoft StreamInsight
Microsoft StreamInsight is a new product included with SQL Server 2008 R2. It is Microsoft’s first foray into the event stream processing and complex event processing market that already has its share of mature products and thought leaders. I’ve spent a reasonable amount of time with the product over the past 8 months and thought I’d try and give you a quick look at the things you should know about it.
1. Event processing is about continuous intelligence. An event can be all sorts of things ranging from a customer’s change of address to a meter read on an electrical meter. When you have an event driven architecture, you’re dealing with asynchronous communication of data as it happens to consumers who can choose how to act upon it. The term “complex event processing” refers to gathering knowledge from multiple (simple) business events into smaller sets of summary events. I can join data from multiple streams and detect event patterns that may have not been visible without the collective intelligence. Unlike traditional database driven applications where you constantly submit queries against a standing set of data, an event processing solution deploys a set of compiled queries that the event data passes through. This is a paradigm shift for many, and can be tricky to get your head around, but it’s a compelling way to compliment an enterprise business intelligence strategy and improve the availability of information to those who need it.
2. Queries are written using LINQ. The StreamInsight team chose LINQ as their mechanism for authoring declarative queries. As you would hope, you can write a fairly wide set of queries that filter content, join distinct streams, perform calculations and much more. What if I wanted to have my customer call center send out a quick event whenever a particular product was named in a customer complaint? My query can filter out all the other products that get mentioned and amplify events about the target product:
```
var filterQuery =
      from e in callCenterInputStream
      where e.Product == "Seroterum" select e;
```
  One huge aspect of StreamInsight queries relates to aggregation. Individual event calculation and filtering is cool, but what if we want to know what is happening over a period of time? This is where windows come into play. If I want to perform a count, average, or summation of events, I need to specify a particular time window that I’m interested in. For instance, let’s say that I wanted to know the most popular pages on a website over the past fifteen minutes, and wanted to recalculate that total every minute. So every minute, calculate the count of hits per page over the past fifteen minutes. This is called a Hopping Window.
```
var activeSessions = from w in websiteInputStream
                            group w by w.PageName into pageGroup
                            from x in pageGroup.HoppingWindow(
                                TimeSpan.FromMinutes(15),
                                TimeSpan.FromMinutes(1),
                                HoppingWindowOutputPolicy.ClipToWindowEnd)
                            select new PageSummarySummary
                            {
                                PageName = pageGroup.Key,
                               TotalRequests = x.Count()
                             };
```
  I’ll have more on this topic in a subsequent blog post but for now, know that there are additional windows available in StreamInsight and I HIGHLY recommend reading this great new paper on the topic from the StreamInsight team.
3. Queries can be reused and chained. A very nice aspect of an event processing solution is the ability to link together queries. Consider a scenario where the first query takes thousands of events per second and filters out the noise and leaves me only with a subset of events that I care about. I can use the output of that query in another query which performs additional calculations or aggregation against this more targeted event stream. Or, consider a “pub/sub” scenario where I receive a stream of events from one source but have multiple output targets. I can take the results from one stream and leverage it in many others.
4. StreamInsight uses an adapter model for the input and output of data. When you build up a StreamInsight solution, you end up creating or leveraging adapters. The product doesn’t come with any production-level adapters yet, but fortunately there are a decent number of best-practice samples available. In my upcoming book I show you how to build an MSMQ adapter which takes data from a queue and feeds it into the StreamInsight engine. Adapters can be written in a generic, untyped fashion and therefore support easy reuse, or, they can be written to expect a particular event payload. As you’d expect, it’s easier to write a specific adapter, but there are obviously long term benefits to building reusable, generic adapters.
5. There are multiple hosting options. If you choose, you can create an in-process StreamInsight server which hosts queries and uses adapters to connect to data publishers and consumers. This is probably the easiest option to build, and you get the most control over the engine. There is also an option to use a central StreamInsight server which installs as a Windows Service on a machine. Whereas the first option leverages a “Server.Create()” operation, the latter option uses a “Server.Connect()” manner for working with the Engine. I’m writing a follow up post shortly on how to leverage the remote server option, so stay tuned. For now, just know that you have choices for hosting.
6. Debugging in StreamInsight is good, but overall administration is immature. The product ships with a fairly interesting debugging tool which also acts as the only graphical UI for doing rudimentary management of a server. For instance, when you connect to a server (in process or hosted) you can see the “applications” and queries you’ve deployed.
  
  When a query is running, you can choose to record the activities, and then play back the stream. This is great for seeing how your query was processed across the various LINQ operations (e.g. joins, counts).
  
  Also baked into the Debugger are some nice root cause analysis capabilities and tracing of an event through the query steps. You also get a fair amount of server-wide diagnostics about the engine and queries. However, there are no other graphical tools for administering the server. You’ll find yourself writing code or using PowerShell to perform other administrative tasks. I expect this to be an area where you see a mix of community tools and product group samples fill the void until future releases produce a more robust administration interface.
That’s StreamInsight in a nutshell. If you want to learn more, I’ve written a chapter about StreamInsight in my upcoming book, and also maintain a StreamInsight Resources page on the book’s website.
June 22, 2010