Archive for the ‘Coding’ Category

Creating an Exchange Feed Recorder (cont.)

Tuesday, June 14th, 2011

A few days ago I started creating an exchange feed recorder, and today (finally), I got the time to finish things off. The outstanding issues were that I hadn't tested anything, and that I hadn't really worked out the start/stop scripts, etc. So today it was a simple matter of testing the code, fixing a few issues, and then setting up the infrastructure so it'd be easy to start/stop the recorder for daily use.

It wasn't all that hard, but it took an hour or so to get it all nailed down. I then spent a little time writing a reader app to allow people to see how you would read from the file, find the datagram, and then process it. Not bad, but it needed to be done in order to show that the recording process wasn't corrupting the data.

Nice to finally get this all nailed down.

Helping Folks Avert Disaster – It Could be a Full-Time Job

Tuesday, June 14th, 2011

This morning I had a nice, long, talk with one of the developers in the shop about a project he was going to be working on. The problems I saw in his design were significant. The application was non-trivial, I grant you, but that should have been an indicator to him that the simple solution of a few structs in a trie wouldn't really be sufficient. Because of the way the feed is coming from the exchange, we really need a far more complex data structure. It's going to make things much faster, but it's going to take a significant amount of time to build.

He was going to build a train wreck, and I hope we averted it. We'll see. There's a lot that needs to be built. I'm sure this isn't the last time we'll talk about it.

Problems are Solved by People that Show Up

Tuesday, June 14th, 2011

This morning I found myself sitting here with nothing at all to do because one of the systems I depend on wasn't up, and there was no one here who knew how to get it up. That's a common problem for me, and in the past, what I've done is to learn how to get these things up, and then get them up myself. It always reminds me of the line on West Wing I saw so many years ago: People... Problems are solved by people that show up.

I could not agree more. Show up. Solve problems. After the day I wasted yesterday on this horrible double-bug, I'm not one to say bad things shouldn't happen - they do. Period. But it's the showing up that's really the key. If there's a problem, then by showing up, you can help solve them.

And showing up after it's solved is like not showing up at all.

When Two Wrongs Make a Right – Finding a Nasty Bug

Monday, June 13th, 2011

bug.gif

Today I've spent all day tracking down the most devilish bug in my code - yup, right there in my code. And the reason I didn't see it right away is that this code hasn't changed in several days, and it's been working perfectly for quite a while. But the trick was that it unknowingly depended on another bug, that was fixed yesterday evening, and because it was fixed, my bug became a real bug. But figuring this out was a painful, and laborious, task.

The set-up for the way it used to 'work' was that I had two services, on two boxes, and each service was hosted by a Broker:

The old, broken way

The client would randomly contact one of the locator services - most likely, it would be the one on the same box as he was running - but there was no guarantee to this. But for the sake of example, let's say it hits the red box. The locator service is asked "can you handle this symbol?" and if it can, it responds to the client immediately. If it can't, then it asks the Broker to list all services that start with the same name, and then proceeds to ask each if they can handle the symbol.

The red locator hits the blue locator, and since it's got to be one or the other, it answers pretty quickly. So where's the bug? Well... the first one is that if there are two services with the same name, we should 'prefer' the service on the same host as the client. This minimizes the network traffic and keeps things "local" as much as possible. You can see it coming, can't you?

With the preference set to local services, the red locator will ask the Broker for all similarly named services and get - you guessed it: itself! This places it in an infinite loop - but with boost asio, there's only one thread to process things, and that one thread can't receive and send at the same time, so we lock up.

Just One Error

So the fix was simple - don't ask for all similarly named services - make sure you exclude yourself! With this simple one-line fix to the code, everything worked again. It was just a complete day trying to figure out where the problem existed in order to figure out that one line that would do the trick. Ick.

Creating an Exchange Feed Recorder

Friday, June 10th, 2011

This afternoon, while I was watching my Greek Engine just hum along as pretty as you please, I decided it was time to get busy on some of the lesser projects that needed to be done. One of the first was an exchange feed recorder. Basically, we needed to just take all the UDP datagrams, tag them with the time they arrived, and write it all out in some manner that makes it not too horribly difficult to read and subsequently process.

Interestingly enough, this wasn't that hard. The hardest part was doing the writing of the file. The basics were already there for me in what I'd done already, and I just had to spend a few hours to get everything cleaned up and ready to go.

Now all I need to do is test it. Pretty nice.

Forked Gist Vim Plugin

Friday, June 10th, 2011

GitHub Source Hosting

This morning I was looking to see if anyone had updated the Gist Vim plugin to support the other functions that I haven't yet gotten time to check on. I found the someone had put it up on GitHub, and checked to see if their repo had the fix I made to the plugin yesterday. He hadn't. But that got me to thinking - Why don't you fork it, and fix it yourself? So I did.

I've now got a repo at GitHub for my fork of the Gist Vim plugin that does a few things that the original didn't:

  • Pulling a Gist doesn't split the window - it takes the whole buffer
  • Pulling a Gist works with the new API v3 GitHub spec

I pushed it up - worked like a charm, and now I've got a place to keep this bad boy up to date. I still haven't had time to look at the other functions, but I will, and I'll fix them, if needed. The scripting language is pretty nice, and I can't imagine I'd need anything it doesn't support.

It's silly, but this is my first fork on GitHub, and I'm really pretty giddy about it. Jazzed, even.

Spinning Up a Complete Greek Engine Configuration

Thursday, June 9th, 2011

High-Tech Greek Engine

This morning I have a pretty stable Greek Engine going, so it's time to try and set up the second machine to handle the second-half of the market. After all, we need to cover all of it, and two boxes is pretty cheap in comparison to the existing hardware footprint.

Thankfully, I've recently received the machines that were ordered for just this purpose. SO I had to check on them - make sure they had the iODBC configuration set up, the FreeTDS config - all the things my processes are going to need. Didn't take long, but it was also the right time to get my home directory set up with the SSH keys, etc. Just to make things clean on the boxes.

After that, it was time to get into the configuration of the start-up scripts in the code and the crontab on the new box. It's not all that hard - by design, thank you, but it's something that I really needed to do.

In the end, I have the entire market on two boxes. Not bad, considering what it's replacing.

Minor Update to Vim Gist Plugin for New GitHub API

Thursday, June 9th, 2011

MacVim.jpg

Today I was trying to use the Gist Vim plugin and noticed that it was a bit broken. In fact, it wasn't even close to working. I was getting HTML redirection messages, and that's not good. So I decided to take a few minutes and see if I could get it back working again. I looked at the site and noticed that it hadn't been updated recently, so it was going to be up to me.

What I figured was that it was in the URL for getting the Gist from GitHub. That's in this code:

  1. function! s:GistGet(user, token, gistid, clipboard)
  2. let url = 'https://gist.github.com/'.a:gistid.'.txt'
  3. let winnum = bufwinnr(bufnr('gist:'.a:gistid))

and after a bit of time reading the GitHub API site, and fiddling around with the URL, I was able to see that the correct URL is really:

  1. function! s:GistGet(user, token, gistid, clipboard)
  2. let url = 'https://raw.github.com/gist/'.a:gistid
  3. let winnum = bufwinnr(bufnr('gist:'.a:gistid))

So I changed that in my copies, and I'm good to go. For a while.

Better Handling of Misbehaving Clients

Thursday, June 9th, 2011

High-Tech Greek Engine

Today I did a lot more interaction with the UI developer that's putting a beautiful web interface on my Greek Engine. Really, he's using the Greek Engine as a support service to a project that the business is really hot to get into production. It's nice, and pulling from a lot of services registered in The Broker. It's just that he was starting to try and hit my service for data, and not getting anything, and I was not really logging what was happening.

What was happening was he was misspelling the command component, and I was doing exactly what I'd told the system to do - which is nothing, and then return nothing. Clearly, I needed to handle mis-behaving clients a lot better.

What I did was to simply add in a check to see if the client's request was going to cause me to do anything. If not, then I logged it, and sent him back an error. That should make it a lot clearer what is going on for malformed requests. With the logging, this allows me to know what's up with the clients as well. Much better.

Adding Self-Discovery to Client Code

Wednesday, June 8th, 2011

High-Tech Greek Engine

As I'm getting ready to move into the "full-up" mode of my Greeks Service, I realize that my client library really needs to auto-discover the servers it can talk to, and certainly, the non-C++ clients need to have an easy way to find out what's hosted where. Given that I really don't want the clients to have to hit something like a mongoDB database and parse the document to get the range, it made a lot more sense to build the discovery into the services themselves. After all, they are going to have to read their configuration from the mongoDB-backed configuration service in The Broker, so it makes sense that if they are already up and going, it should be very easy to have them respond to simple requests for what they cover.

So the first thing to do was to add a "coverage" request/response to the application. This would respond with a list of two strings - the beginning of the covered range, and the end of the covered range. These are obtained from the OPRA channels that we're getting data from, and it's also the filtering criteria on what options to load in from the data master database of instrument data.

This was really simple because there's no need for any security on this call, and it's the only 'call' (aka 'one shot') that the service handles. Pretty simple. With this, the C++ client can then ask The Broker what services it knows about, filtering out based on a very simple pattern match to find the ones that are for my Greek Engine, and then ask each for their coverage. It then builds up the map and it's then easy for the client to know who to dispatch the request to. Nice.

But that's not the end of the story. I also wanted to make it possible to have a client ask a service where to go. This is a very simple service that does basically the same thing, but if we have one per Greek Engine service, then The Broker can load-balance between them, and each will know it's own engine's coverage quickly and easily, and it can repeat the process the client code uses to find out all the others. Once it's got this data cached, it's fast.

The upside is that we have a simple "piggy back" service on each Greek Engine that can locate any symbol or family very quickly. This means that if a web sockets client needs to know who to talk to, they can his this locator service, ask it, get the service name, and then talk to it. Pretty nice.

With this, I've got a pretty nice self-discovery system in place. I could add it to my Ticker Plants too, but I'm not so sure they will benefit as much from it. They broadcast on ZeroMQ reliable multicast channels, and the client is the only way to get to that data. So it's not as big a deal. But it's a nice solution, and I can think about it for later.