Archive for the ‘Coding’ Category

Smartest Way to Speed Up: Just Do Less

Monday, February 6th, 2012

High-Tech Greek Engine

Today I spent the vast majority of my day today trying to make this one client application of my greek engine a lot faster. I mean a lot faster. Friday afternoon, I was running some tests on this usage pattern, and realized that the client really was seeing some massive delays in getting data from my engine when dealing with very large, very active families. Using SPY as the example, there are some 2500 derivatives on SPY, and calculating their data and returning it to the caller was taking from 1800 to 2200 msec. That's a long time. The problem was magnified because all they wanted was three of the 2500 options, and they had to wait for all 2500.

Not good.

So Friday I jotted down a few ideas to try today and spent the first few hours doing just that. Each one was a little better, but I was still looking at 1300 msec, and that's just too long. I needed to chop out an order of magnitude or two. So I started doing the profiling. What was it that was taking so long?

Well… it's the calculations. That's no surprise, but it's a real bottleneck too. We can't really afford to make the calculations tie up multiple threads. That'd kill the box with some 50 clients each needing multiple threads for their calks. Not good. I tried to look at other things, but in the end, it always came back to the calculations.

Along the way, however, I did come up with a few really fun optimizations. I was able to look at a continually updating profile of the instrument and use those values to 'seed' the request, but the updates from the market were just so frequent, it was impossible to stay ahead of the updates. It was a real problem.

So I did what I should have done first - go and talk to the coders writing the client app.

I found out that all they really wanted were the implied vols and they only wanted two or three options in each call. Well… now that's very interesting. That's a use-case that I hadn't expected. The reason it's very interesting is that the implied vols can be calculated independently of each other, which means that by telling me you're interested in only the implied vol calculations, I can look at the three options you're asking for, and calculate just them. Sweet.

I had to work into the API the idea of the type of calculation, but we had something pretty much like that already in the API - it just needed a simple extension. And then I had to get the different type handled in the code. In the end, it wasn't too bad, and the time savings were amazing!

The 1800 msec went to 20 msec. That's something that's more than fast enough for what we need. All because I listened to what the client specifically needed. Simple way to be faster? Just do less.

Excellent.

Updated Git to 1.7.8.4 on My MacBook Pro

Monday, February 6th, 2012

gitLogo.gif

This morning I thought that git on my MacBook Pro might be a little behind the times. I don't honestly think there's a huge difference from 1.7.4.1 to 1.7.8.4, but you never know, and it's simple using the Mac OS X installer. Just download it, double-click, and it's ready to go.

It's nice to see:

  $ git --version
  git version 1.7.8.4

Nice. Love it when things "just work".

Interesting… I just noticed that Mac OS X 10.7.3 comes with git - and it makes perfect sense that it does. Xcode uses git now, and so it'd require that the OS - at least the developer tools, would have to have it. So it's not necessary for me to worry about updating this any more. It's nice to have a secondary source, should Apple decide to drop it's support, but I'm guessing that's not going to happen anytime soon.

Interesting stuff…

Exchange Timezones Hit Me Again

Friday, February 3rd, 2012

bug.gif

This morning it was brought to my attention that my ticker plants were showing the open on VIX, quoted out of the CBOE, as something different than the legacy feeds were showing. All my different feeds (dev, staging and prod) showed the same number, so I let the QA guy find out what the problem was. I had no idea where it was coming from. I wasn't even sure I was wrong.

So he asked one of the legacy developers, and sure enough, his numbers matched Yahoo! Finance. So it looked like there was something to this. So I started looking into the trade feed. Thankfully, I have a nice, stable, feed recorder and query service already going, so it was just a matter of giving it the right parameters and it would pull up the files, uncompress them, decode them, search them, and deliver me the results.

I had to admit, this isn't the first time I'd wished I had a web interface for this, but alas, I don't.

So I looked at the feed, and realized that at 8:30 am, the trades arrived, but they were not marked as 'valid' trades. This is odd because all Index "trades" are valid if they arrive after the open. And it was, after all, after the open.

So I looked at the code - and sure enough, there was the problem. The CBOE is the one exchange I listen to that's located in CST as opposed to EST. That means that the time of the "open" for the CBOE is 8:30 am, and not 9:30 am, like the NYSE, PHLX, etc. This is something I'd planned for, but hadn't remembered to use in this particular exchange codec.

The fix was simple - use the CST open, and all would be fine. Unfortunately, that means that the data for today for the indexes from CBOE is messed up, but at least it'll be right for Monday. Just all the little data things that need to be fixed up… it's getting to be fewer, but I'm sure there's still a lot to find.

Google Chrome dev 18.0.1025.3 is Out

Friday, February 3rd, 2012

Well… it's only been a few days since 18.0.1025.1 was released, but I guess the Google Chrome team realized that there were a few outstanding issues that warranted a new release and a few ticks in the version number. Nice that they all appear to be fixes for crashing bugs… way to keep on the crashers, guys.

This is Why Codecs Have to be Strictly Controlled

Thursday, February 2nd, 2012

bug.gif

OK… so I'm working on some nice little features on the greek engine today - adding a few nice IRC commands to the system, and making things just a little bit nicer for the support staff, and we start getting these odd problems. In some cases, the response time from the engine is wildly varying, and in other cases, the memory footprint is far too big. All very odd, seemingly unrelated, but all timed to happen today.

So I started looking at yet another problem - one of the clients to the engine is sending a Close Channel message when it wasn't needed. That, in and of itself, is not the problem, but on closer inspection, the contents of the message are alarming:

[asyncRead] a close channel for unknown channel was received
[asyncRead] 58 af cb a6 3b 49 db 41 03 84 4d 11 c4 5e 55 d1 05 45 12
            X…;I.A..M..^U..E.

the sec on line of the error message is the binary contents of the close channel message, which should contain an 'X', followed by the 16-byte channel ID, followed by an encoded variant value. In this case, the 'E' means it is an error, and by definition that means that a varint-encoded number follows, and after that, another variant that is the "reason'. The value after the 'E' is intended to be the numeric error code, and the variant is meant to hold the message or messages that accompany it.

But as you can see, there's the 'E', and a varint-encoded value, and then nothing. In my decoder, I look at the next byte and try to decode it. If that happens to be a String, or a Map, I can go off into lala land and decode a GB or two. Not good.

The solution? Well… there's two: we have to get the app (or app writer) that generates this malformed error to correct their mistake, and until this person can be identified, we have to protect our decoder against this kind of problem and put in a simple test:

  if (aPos < aCode.size()) {
    mErrorValue->get<1>().deserialize(aCode, aPos);
  }

The real problem with this is that someone created a codec that encodes data improperly. To what extent? I have no idea, but this kind of things is capable of bring down a whole lot of servers and clients. I'm lucky it's not been a lot worse. But it underscores the need to have a group of people that control these critical components, and not allow just "anyone" to fiddle with them. The risk and consequences are just too great.

I wish I had faith that this will be the catalyst to stop all this, but I have serious doubts about it. I'm certainly going to try to get it to create some change. It needs to happen, and it needs to happen now.

Bloomberg Gets a Pretty New Face

Thursday, February 2nd, 2012

GeneralDev.jpg

Yesterday, I heard about Bloomberg's new Open API initiative. It's a new .Net, C++, Java, and C API that is "Open" for all to use and make use of. The catch is that all the data you'd want to get is really still exceptionally expensive, but that's Bloomberg, eh? The last time I used a Bloomberg API it was the Bloomberg Server API, which was a mild modification on the old Bloomberg Terminal API that came with every Bloomberg Terminal - Windows and Solaris, going far, far back into the past.

I've just briefly scanned these docs, and it's a new API alright. Much easier to deal with, and hopefully far easier to decode the data once it's returned from Bloomberg. I like that they are trying to really make it easier to use - both in the pub/sub and the req/resp modes. It's an improvement.

Heck, almost anything is an improvement.

Still, the kicker is the cost of the data. When last I looked, it was still some of the most expensive data around. I mean outta sight prices. I don't think it's gone down in the last two years, but I could be wrong.

Yet I can't blame them. They have a nice gig - they have a great reputation on the street for their data, and so they can charge a ton and use that to keep away the riffraff. It's working for them, and who am I to give them grief. Sure… I'd love to build a system off this for the Mac and build in all the bells and whistles, but that's a really hard sell as the data is so expensive and all the online brokerages are giving their data away - with decent tools.

Still… if I hit the lotto, I'm all over this.

A Letter to a Dear Friend

Thursday, February 2nd, 2012

This morning I was thinking about the particular situation I find myself in at work. Interestingly enough, the one guy that I thought could really give me great advice is one of my oldest friends - Bret from grad school. I've known Bret since 1980 - that's more than 31 years now. We've worked together, laughed together, and lived a long time together.

To this morning, I wrote to him to ask him his advice:

I've been struggling here at work for the last few months - amid some massive re-orgs (yes, multiple massive re-orgs in that time), and in the midst of all this, I thought of the one person that I could really trust to give me some solid advice - you.

So here's what I'm struggling with: When I hired on here at The Shop about 2 yrs ago it was all about who I was going to be working with, and how we were going to be developing, and no more crap for HR… all the things that after a long stint at First Chicago, then UBS, I was happy to hear. It started out great, and my manager was just made partner, so it seemed like it was going to be great for a long time.

Then things changed. My manager, Clive, was put in charge of all IT for The Shop. Everything. And it's changed Clive. We no longer work together. For a while, I found someone that reminded me a lot of you - funny, easy to laugh, good coder, thoughtful. A really nice guy to work with. And while it was a little team of the two of us, it was great.

Then Clive decided that his view of IT needed to change, and that guy, is now managing the group I'm in - a group of 14 people.

Out the window goes the "who" I work with. Now I'm working with regular (which is to say, junior) guys that are dolts in comparison.

Out the window goes the "how" I work. Now things can't be released unless we have a meeting about it and it'e perfectly acceptable to leave bugs in production until that time. There are times they will have to check to see if it's OK to fix a bug - priorities are important, after all.

Out the window goes everything that I once liked about this place.

And so I'm asking you: How do you do it?

How do you work with people, systems, organizations, etc. that are clearly more like Roman galleys than places for creative people to work. It's not that I mind hard work, it's the conditions under which it's produced. Maybe I'm just fooling myself that a place like this Shangri-La even exists, but I'd like to think it does. But maybe that's my problem.

Maybe I need to just accept that people that want my effort, my energy, my work really aren't interested in my best work - they would be happy with 80% - if they get to choose the terms under which it's given.

Anyway, I'm hoping that you have some words of advice for me. Something that I can use to re-adjust my thinking, to re-align my sights - to get to a place that I don't dread coming to work.

Anything you have would be really helpful.

I'm hoping he's got some good advice for me. Stay tuned.

[2/13] UPDATE: I wasn't disappointed… his letter was right on target and it got me to thinking about what I need to do:

Hmmm, well, I think I should tell you a story. This is how my thinking has changed during the last 6 months of my last job. It has to do with all that's happened before but took a form I could articulate last year.

I started working for Avocent in 2008. It was a new team building a pretty cool product. Long story short, it was the best team I'd ever been part of. Best is terms of mutual respect, fun, and actual quality and quantity of output. Then we were bought buy a much bigger company. Things changed like black and white. One day when I was thinking about my options a light bulb went off. Every job I've ever had started out hopeful and for varying lengths of time was pretty rewarding. But something always happened to change that. What I realized was not that things always change. It was that *I* have been wrong every time about my estimation of the longevity of the job. Every time. On that day I made two decisions. Or rather two changes in my thinking. One is that I don't care one wit about the longevity prospects of a job opportunity I'm considering. Everyone tries to sell you and the vast potential of whatever they are selling. Now what I'm about to say will sound harsher than I really think in general (I mean I've not turned into a hopeless cynic, far from it) but to the job salesman I say bullshit. But really it's my desire to assume more than I should that I call bullshit on. Here's the deal. I've been wrong EVERY time. It's not that I didn't have educated assumptions, I believe I did. Doesn't matter. There are too many factors that can change. I NEVER saw the purchase coming by a company that was both large and insane at the same time. So, to be clear, I'm not jaded, I just don't consider longevity to be a factor. I just want to know if the work is interesting. If things change I'll look again. But I said I made two decisions. The second I'm still working out in real life. Since I can't count on others for long term job satisfaction, my goal has changed. I used to want to find a job that was "interesting" (there are many dimension to what "interesting"means). What I realized is the reality that I could continue this path of going from job to job (really meaning from employer to employer) as things change, to I could seek to become independent of that rat race. The best word I have for what my goal is right now is independence. There are just way too many ways today to make your own path and divorce yourself from the work you want to do and a bunch of other factors (where you live, who you work with, etc.).

I guess in answer to your question of how I do it, I don't think I do really. I've always moved on. That takes time sometimes, but the mental switch flips pretty easy and hasn't ever flipped back. In the meantime, be yourself, advocate the quality you expect. THAT is hard and I've failed many times but that's the standard to measure against. Remaining true, that is. This has been a bit of a ramble. There's probably more to say so feel free to call anytime. I mean it. I'm living this out everyday right now so talking this stuff through would be helpful to me too. It's been good for me to reflect on this as I've typed this much to you so far.

Take care and let me know how things go.

He's dead right, and I knew it before he even wrote back. The problem is me and my expectations. I need to lower them. Way, way, lower. When I was new here, and had lower expectations, things were a lot better, but as I started doing more work here, they rose on the hopes that things were really going to be great. Big mistake of mine.

Focus on the things that are important to me. That's the ticket. It's not important that I'm a convert to the cause, I just need to be a solid, good, hard worker, and that's always going to happen. It's when I think they have the same vision as I do that things go sour. I just need to keep a respectful distance. It's not easy for me, but it's important.

Thanks, old friend. I knew I could count on you!

The Realization that Things are Exactly as They Want

Wednesday, February 1st, 2012

Today has been a pretty big day for me. I've written a lot of code to try and get things up to date and operating at good speed, and there hasn't been a single thing that's stumped me. I got it all done, exactly when they needed it, and all in the branches they needed - merged into the right branches and ready to go. Can't complain about that at all.

But I've also been doing a lot of talking with my new manager, and yesterday I had a long talk with my old manager. I think I've come to the realization that this place is not in transition, at least not completely. It's evolving, and in that evolution, it's changing exactly how it wants to be changing, and my concerns about where it was versus where it is, and where it's going, are really all my fault. That is to say - all my issues.

This place is doing exactly what it wants. And more of it. It's moving away from the structure and people that I hired on to work with, and hired on to work like. I can write code anywhere, I came to this place to work with specific people. Now that's no longer possible. I came to work in a specific way - again, no longer allowed.

In short - everything I liked about this place in coming here is being pulled away, and it's very clear that it's not that my concerns aren't heard, it's that they don't seem them as concerns! They see them as achievements!

So where does that leave me? Well… it's a place I no longer want to be, but it'll take me time to find a place that, once again, appears to be the kind of place I want to work. I'm hoping the search isn't too long, but no matter how long it is, I'm here, and I'll play their games, but I can't honestly pretend to like it. It's nothing like what I want to do.

But for now, it's my paycheck.

But not for long.

Fantastic Difference in Keys for boost::unordered_map

Wednesday, February 1st, 2012

Boost C++ Libraries

Today I spent most of the day on a few issues - the biggest of which was the problem that a restart lost those Option calculation results that I was working so hard to retain. What I needed to do was to persist them to the redis cache server. No way around it. The problem was, I had nothing written for this, and a few other little testing bugs came up and I had to fix them as well.

The persistence system is modeled after all the other save/load persistence methods I have in the code. I have a save state method, and a load state method, and all I needed to do was to put the save method where I was saving other state data, and put the loader after I'd loaded up the instruments and before I continued on with the initialization of the system. In theory, not too bad.

In practice, I had to do quite a bit of detail work, but that was expected. It's non-trivial to persist a class that's not been fitted for persistence, but it's all straightforward. It just took time. No… the problems were after I had that code working, and I went to test it.

The original data structure I used to hold this Option calculation results was a boost::unordered_map:

  struct Results {
    uint32_t         volatility;
    msg::greeks_t    greeks;
    uint32_t         impVolatility;
    msg::greeks_t    impGreeks;
    // …and more fields like this…
  };
  typedef struct Results results_t;
 
  typedef boost::unordered_map<msg::ng::secID_t, results_t> ResultsMap;

The secID_t is a SecurityID - a 128-bit number that packs all the critical instrument identification information into one nice little package. This makes sense as a key in the map because it's unique, easy to use, and the core of a lot of look-ups in the rest of the system. Sounds great.

When I made the deserialization method for the data coming out of the redis server, it looked very simple:

  bool unpack( const std::string & aBuffer, uint32_t & aPos, ResultsMap & aValue )
  {
    bool       error = false;
 
    // first, let's get the number of pairs in the map
    int32_t   sz = 0;
    if (!msg::ng::unpack(aBuffer, aPos, sz)) {
      error = true;
    } else {
      msg::ng::secID_t  id;
      results_t         res;
      // now, for each pair, read them in and set them
      for (int32_t i = 0; i < sz; ++i) {
        id.extractFrom(aBuffer, aPos);
        res.deserialize(aBuffer, aPos);
        aValue[id] = res;
      }
    }
    return !error;
  }

I get the size of the map, then read in each pair and save it in the map. Simple.

Well… simple - yes. Fast? No.

The initial tests had this at 3 minutes for 88,000 pairs. That's horrible results. I did some profiling and it's in the one line:

        aValue[id] = res;

That makes no sense. Boost's unordered_map is the fastest around. I don't understand what's happening. So I did more digging. The next thing I found was that there wasn't an exposed hash function for boost to use. So I added it:

  namespace msg {
  namespace ng {
  size_t hash_value( const secID_t & anID );
  }
  }

and implement it simply as:

  namespace msg {
  namespace ng {
  size_t hash_value( const secID_t & anID )
  {
    return anID.hash();
  }
  }
  }

I expected this to speed things right up. The actual results? Same slowness. Amazing!

Maybe it's the security ID? Maybe it's be better with a Security Key - a std::string version of the same thing? Maybe boost is a lot more efficient dealing with string keys, even though they aren't more space efficient? Worth a try… we can't leave it at a 3 minute start-up.

So I went to:

  typedef boost::unordered_map<std::string, results_t> ResultsMap;

and then:

  bool unpack( const std::string & aBuffer, uint32_t & aPos, ResultsMap & aValue )
  {
    bool       error = false;
 
    // first, let's get the number of pairs in the map
    int32_t   sz = 0;
    if (!msg::ng::unpack(aBuffer, aPos, sz)) {
      error = true;
    } else {
      std::string  key;
      results_t    res;
      // now, for each pair, read them in and set them
      for (int32_t i = 0; i < sz; ++i) {
        if (msg::ng::unpack(aBuffer, aPos, key)) {
          res.deserialize(aBuffer, aPos);
          aValue[key] = res;
        }
      }
    }
    return !error;
  }

With these changes, the insertion of the values into the map went from 3 minutes to 61 msec! That's a factor of almost 3000x! Wow! OK… now I know. Don't use security IDs as keys in the boost::unordered_map. It's just not even close to fast enough.

Google Chrome dev 18.0.1205.1 is Out

Wednesday, February 1st, 2012

Google Chrome

This morning I noticed that Google Chrome dev 18.0.1025.1 was out and it looks like a few nice things for me in the release notes. SPecifically, the latest V8 javascript engine (3.8.9.0), and then a few nice Mac-specific things like fixed Lion gestures, fixed momentum scrolling in frames, and an issue about the devtools closing prematurely. All nice things to have. I'm really glad that aren't focusing on Windows and leaving Mac OS X a version or two behind. That would be sad.