a life of coding

Looking at the Facebook Patent

2010-02-26T10:26:00.003-05:00

I Am Not A Lawyer.

Facebook was recently granted a patent for "Dynamically providing a news feed about a user of a social network". This has generated much news, and has many people complaining about software patents. Unfortunately most comments look no further than the title.

The title is irrelevant. In fact, most of the text of a patent is irrelevant. Whenever you are presented with a patent, you should start with the independent claims. The claims are what you litigate (sue over), and the independent claims are the broadest. If you create a "social network news feed" which does not match one of the claims in Facebook's patent exactly (or equivalently), this patent has no bearing on you.

The three independent claims (1, 16, 24) are almost exactly the same, differing in the embodiments of a "method", a "system", and a "computer readable medium" and changing the grammatical structure to fit. Here is the first one, the others are trivially the same:

1. A method for displaying a news feed in a social network environment, the method comprising: monitoring a plurality of activities in a social network environment; storing the plurality of activities in a database; generating a plurality of news items regarding one or more of the activities, wherein one or more of the news items is for presentation to one or more viewing users and relates to an activity that was performed by another user; attaching a link associated with at least one of the activities of another user to at least one of the plurality of news items where the link enables a viewing user to participate in the same activity as the another user; limiting access to the plurality of news items to a set of viewing users; and displaying a news feed comprising two or more of the plurality of news items to at least one viewing user of the predetermined set of viewing users.
Infringing this claim requires (a) monitoring activity, (b) storing it in a database, (c) generating new items from the activity that you plan on showing users, (d) attaching a link to participate in the same activity, (e) limiting who can see them, (f) displaying more than one news item to a user who did not specifically ask for it.

I would say that any basic security monitoring and alerting system would be sufficient prior art, except for (e). The claim specifically mentions attaching a link from "attaching a link associated with at least one of the activities of another user to at least one of the plurality of news items where the link enables a viewing user to participate in the same activity as the another user". This seems an oddly specific limitation on the claim, which leads me to believe (given the massive prior art list) that the claim was not accepted without it. So, if you write a news feed that does not attach at least one link that does exactly this, you are not infringing this claim.

PS: You should should have realized by now that this patent has nothing at all to do with twitter. Or really most social network feeds.

[ more comments at Hacker News ]

Skirt the AppStore: Run Unsigned Code on a Stock iPhone / iPad

2010-01-28T13:19:00.004-05:00

Another iPhone OS product, another flame-war over Apple's "iron fisted control". I think the whole argument is pretty silly, and to hammer it home I gave this post a title that's sure to anger lots of people. Why? Because what I'm about to say is (a) obvious, (b) correct, and (c) still not what people want to hear.

You can run arbitrary code on an unmodified iPhone, and the same will be true of the iPad. You will still have certain restrictions (only one app at a time, no access to the data in other apps, and limited access to hardware), but rarely do people complain about those. What people do say is that "Apple will stop me from doing X, because X interferes with their business model, or angers the carriers." Examples are Skype over 3G, Google Voice... really, there aren't that may reasonable apps that have been turned down on the AppStore, but as I said, this is a flame-war and "reasonable" isn't part of the vocabulary.

You can run whatever your little heart desires on your iPhone/iPad, and it only costs you $99 more. This product "upgrade" is called the Developer Program, and has been available for quite some time. After paying the fee, Apple lets you download a piece of software that lets you run almost anything that you like (I already listed the restrictions) on your iPhone. Done!

Whats truly amazing is that this is always how desktop development has always been done, except on Linux and Mac OS X. Pay your money, get your compiler, write your software, run your software. The Linux and Mac OS X crowd is up in arms because they have had GCC for at least 9 years, and $99 seems like a lot compared to free. Where things differ is what happens after you have written a piece of software that you like and runs correctly. Traditionally, you make the binary available and people can click it and have it run (insecurely) on their computer. With the iPhone OS, you send it to Apple, they make sure that it fits some minimum criteria, and now you can require people to pay you to use this software. The end user even gets a convenient update mechanism, and a way to re-acquire your application without re-paying should it be deleted.

Want to be a revolutionary? Here's some advice: stop complaining, join the developer program, and make something people want.

The Most Amazing Error Condition

2010-01-21T14:53:00.002-05:00

I stumbled on a dumbfounding message today. It said: "The application Preview can't be opened -10810." I saw another one later that said: "The application TextEdit can't be opened -10810." I have seen strange errors when upgrading (I assume due to pieces of libraries cached in memory differing from the remaining code on disk), but I had not upgraded recently. Thinking that I may have run out of disk space, I decided to check "df -h". Opening a new Terminal window just said: "Could not open a new pseudo-tty."

"Huh. Thats odd..." A search brought up some old Linux threads mentioning a maximum number of processes-per-user. I couldn't think of any reason why I would have too many processes, so I went to an existing shell, typed "ps -ax" and got "fork: Resource temporarily unavailable." Further adding to the awesomeness of the situation. But, I had a possible out - if there were indeed too many processes running, I could sacrifice some and the problem would (temporarily) go away.

I did manage to run a "ps -ax", and I saw a large number of:

  502  4344 ??         0:00.00 (Google Chrome He)
  502  4432 ??         0:00.00 (Google Chrome He)
  502  4433 ??         0:00.00 (Google Chrome He)

... zombie Google Chrome processes! Hah! I have been running Chrome for long periods of time, and its process management system is not functioning correctly. Close Chrome, problem solved.

Man, that was a weird one.

Paul's Mistake (Be Strong, Not Whiny)

2009-11-19T16:54:00.007-05:00

Paul Graham recently commented on the state of Apple's AppStore and their treatment of developers. The comment seemed uncharacteristic of him, dedicating a lot of time to chastising Apple for their actions and claiming that this will substantially tarnish Apple's reputation. I respect Paul, but feel that his tone is wrong. Only time will judge whether Apple's treatment of developers will keep them from being successful. Instead of complaining, we should be analyzing how Apple has been successful and attempt to reproduce that on our own. Its the longer, harder path, but it will make all the difference.

(continuing with the previously titled "Developers: You Are Whats Wrong With the iPhone AppStore")

You pounded your fists because Apple didn't have a "real" SDK, so Apple created one. Then you pounded your fists because the approval process is too slow, so Apple hired a bunch of noobs. Now you're pounding your fists because the newbies aren't consistent in their execution.

Stop. Stop making iPhone apps. Stop complaining about Apple. Stop using their phone. Stop.

I am a long time Mac user, and I will tell you that nobody cares about your app, least of all Apple. I didn't buy the iPhone because of you, I bought it because of Apple. Apple put a bunch of cool things on it, and they all work (mostly) great. Most AppStore apps sell for a dollar, because thats about all they're worth. (BTW: please disprove this by creating something valuable.)

Since Apple made a great phone, they have become popular. But Apple isn't good at popular. Popular people have to bend to the will of others to stay popular, and that just isn't Jobs, and it isn't me either. Apple wants to be the best, and you have to be a little elitist to do that. You have to somewhat ignore what everyone wants and concentrate on what is best. Personally, I'm okay with this, and so are most of the people who bought an iPhone.

When Apple releases a tablet, will your iPhone app work on that? Of course not. What about Google's Droid? Not there either. Your apps will be rewritten (and re-tested) on every platform there is. The reason that we were all making web apps before the iPhone was that web apps actually work everywhere. JavaScript+HTML is the data interchange format for executable code (delivering what Java was supposed to). Don't like Apple's phone? Get a new one, your email, photos and web applications will continue to work there. Man that sounds nice, doesn't it.

If Apple is pissing off developers, I personally think this is great. Instead of building yet another iPhone app that Apple has to test and only your friends will ever use, you'll think about building your own hardware, or an app for another phone, or a web app that will be usable on any device for decades. Better yet, build a WebAppStore that works exactly like the AppStore, but for web applications.

You can pound your fists all you want, but only a couple of your peers are even listening.

[Discuss at Hacker News]

Cleaning MacPorts Dependencies

2009-09-02T16:33:00.007-04:00

Sometimes MacPorts gets a little carried away with dependencies. I recently tried installing mercurial on 10.6, and found MacPorts yak shaving various X11 libraries. After deciding that this yak didn't need to be shaved, I cancelled the install and downloaded the Mac OS X mercurial installer (index).

Unfortunately, canceling a port install leaves behind extra packages. Which packages? Good question. I knew that I had asked for some packages (libidl, graphviz, bash-completion), but I didn't know which packages were only installed as dependencies of my half completed mercurial request. Given a package, port can find the packages that are dependent on it. Hmm, this sounds like some programming homework...

Given a map of package names to a list of packages dependent on that package:

find packages with no dependencies, which were not explicitly installed by the user
if there are none, you are done
1. add them to a list of packages to remove
2. remove their map entry
3. remove them wherever they appear as dependent on another package
recurse!

Google Wave isn't Email, it's Facebook (and Twitter, and AIM)

2009-08-10T14:29:00.005-04:00

Google Wave is an upcoming product form Google, and has been described as a replacement for email. In the demo at Google I/O, real-time interaction was pervasive. Data in the format of emails, instant messages, tweets, and individual chat keystrokes were shared between multiple people at the same time. Participants were added in very granular ways - a whole chat, a paragraph of an email, an ongoing conversation starting from a specific point. Everyone proclaimed that Google Wave was The New Email™, and would be used as such.

Possibly, but Wave is far more powerful than that. At the heart of Wave is Jabber (XMPP), a technology that few people know the importance of. Some people might recognize it as the transport for Google Chat, but the real killer application of Jabber has been secure, real-time, in-house chat at financial and government organizations. Jabber is an efficient way for people to selectively share information with lots of people without needing a central server owned by an untrusted company.

Jabber has not been a breakout success, likely because it cannot be used via a web browser, and instant messaging has a strong network effect. Google Wave exposes Jabber to web browsers via a custom Jabber server, AJAX, and the Google Web Toolkit. To combat the network effect, Google integrated their email service and then showed how two people using Wave would have an improved email experience. This embrace and extend approach will probably be the primary growth mechanism for Wave as a technology, and Google as featured API's for embedding Wave into existing web pages, and providing external services to Wave users.

With a web browser interface, a Jabber back-end, and well documented extension API's, Wave is extremely useful. Write a robot for Twitter (which Google has already demonstrated as Twave), Flickr, and Blogger (also demonstrated by Google), and you recreate the core features of social media sites like Facebook. In fact, all that you would need to finish your proto-Facebook is some access permissions on the data (already part of Wave) and a mechanism for managing who has access to which data. Any data stored out on the external services (Twitter, Flickr, etc) will have their own permissions schemes, but data stored in Wave will be visible to people that you choose, just like Facebook.

Your friends might not even have an account on the same Wave server, but due to Jabber federation, thats okay. One convenient reason for telling people that Wave is like email, is that people will add their server name when giving out their account, just like an email address: ynniv@ynniv.com, or ynniv@mac.com. GMail users are used to giving out their email address for all sorts of things: email, instant messaging, voice and video chat, document collaboration.

If Wave turns out to be as important as I hope, people will stop differentiating their social networking account from their email account, and help break up the centralized control of personal information in the process. Back before the bubble, we thought that the Internet was going to be a decentralizing, democratizing force. Everyone on the Internet could send or receive data from anyone else. Email, and the web were products of this vision. Social networking and instant messaging has taken that away from us, placing all the control and everyone's private data in just a few hands. Jabber was created with that original decentralized vision, but never overcame the strong network effect in the instant messaging world. Google Wave could be the sugar to help the medicine down, finally bringing decentralization to the social web.

The Wave Way IS the Web Way

2009-08-10T13:09:00.003-04:00

Anil Dash at Lifehacker recently wrote that Wave would face difficult adoption due to complex and rigid APIs. He says that the web has incremental upgrades, has a "weekend-sized" barrier to entry, has value independent of the network effect, and is easy to understand an explain (a duplicate of the second point, really). By comparison, he thinks that wave is big and complicated. It is my opinion that Wave applications will be similarly sized, similarly complex, and not require your friends to join at all.
He starts off by saying that Wave is composed of the following technologies:

Federation (XMPP)

The robot protocol (JSONRPC)

The gadget API (OpenSocial)

The wave embed API (Javascript)

The client-server protocol (As defined by GWT)

and that "combining all of these pieces would just be the starting point" to development. This sounds to me like claiming that developing a Google Maps application involves working with NavTeq and writing tile rasterizers, or that making a website requires writing an OS, a web server, and a web browser. Here's what people will use to develop with Google Wave:

Make a wave view appear in your web page via the embed API (Javascript)

and / or

Make a view or interface in Wave using the gadget API (OpenSocial)

and / or

Provide outside data or services to Wave with the robot API (JSONRPC)

All of them will have demo code that you can understand "in a weekend on your couch with a beer", or however else he thinks that people develop Web 2.0 apps.
Dash instead advocates a different API that does actually require you to host your own servers and federate over XML-RPC. The reasoning is that its easier for people to write their own server with XML-RPC than XMPP. XML-RPC is less efficient than Jabber (XMPP), so for intensive applications, should we be worried about performance? From the pushbutton page:

Scaling issues? There will inevitably be some learning to do about how to scale the resource-intensive hub layer of a Pushbutton system. But because the hubs live on cloud systems that make enormous amounts of computing resources easily available, because the coders creating the reference implementations of the hub software have great experience making web-scale systems, and because it's relatively simple to introduce new hubs as needed, this will likely not be a gating factor for adoption of Pushbutton. Worry? No

I think that last statement should read, "Worry? YES". Instead of using a properly written, efficient server, we should all run home brew, inefficient servers on huge clouds, because computing power is free! Google is going to release the Wave server open source, and its going to be really easy for you to run theirs on your hardware. Web programmers don't worry about writing web servers, and Wave programmers should not worry about writing a Wave server either.

Wave Is The Web

Wave actually is the web, and people should stop comparing it to email. The Wave interface that Google demonstrated was a web page. It communicated with the browser via AJAX. A Wave interface element ("gadget") can be embedded into any web page, providing immediate functionality to anyone visiting that page, no account required. You will get additional functionality by creating an account, but there is still inherent value in receiving fresh data, and interacting with services in real-time, anonymously. There is no intrinsic network effect at play here. I predict that Wave is going to show up embedded in web pages everywhere in the form of live Twitter feeds and real-time collaborative features. No one has to know, they just keep doing what they're doing.

Improving Common Tag : Worse is Better

2009-06-12T10:54:00.005-04:00

Common Tag is an "open tagging format developed to make content more connected, discoverable and engaging" [commontag.org]. It mixes RDFa into XHTML to add metadata to specify metadata for a content block, most importantly a link to a common database entry that iconifies the topic like the one word tags used in many places. This is an improvement over word tags, which can be non-descript or ambiguous: does "apple" refer a fruit, a computer company, a record label, or someone's name? Tags that are acryonyms may have no meaning to the user. RDF is commonly referred to as the Semantic Web, because it helps computers link concepts together. Everyone wants the Semantic Web, but somehow it never happens... maybe its because RDFa looks like this:


<body xmlns:ctag="http://commontag.org/ns#" rel="ctag:tagged">
    <span typeof="ctag:Tag" rel="ctag:means" 
         resource="http://rdf.freebase.com/ns/en.u2"/>
</body>

This is a very explicit piece of data. Much of its content is XML support structure. The semantic knowledge contained in there is:

the tag for this span is "en.u2" in freebase

The structure contained in there (removing any content) is:

xmlns:ctag="http://commontag.org/ns#"
rel="ctag:tagged"
typeof="ctag:Tag"
rel="ctag:means"
resource="http://rdf..com/ns/"

There are a lot of things that can break without actually removing any semantic meaning. If there were a typo anywhere in the structure above, your tag would be hopelessly borked - all that work (and bandwidth) for naught. More importantly, this format says that if I want people to understand my tags I have to embrace XML, and there are few things that I dislike as much as XML. Look at all the structure required because of XML, and the complicated tools that are required to manipulate XML!

Lets take that content and put in something sexier: simple HTML. Over at Hacker News, someone suggested

<p ctag="wikipedia/The_Beatles">We're talking about The Beatles here</p>

I like this direction, but its lacking in a gruesome way: ctag is not a valid HTML attribute. Browsers may not like it, and they certainly won't be able to read it, so it isn't as clean as the RDFa. How about this:

<p class="-ctag-wikipedia-The_Beatles">We're talking about The Beatles here</p>

This is valid HTML, browsers can operate on this content, and you can even style it! This is much better than the previous suggestion (which was quite good, and spurred me to write this article), but is it better than RDFa?

Content:

the tag for this span is "The_Beatles" at Wikipedia

Structure:

class="-ctag-"

It is by far more concise than RDFa, but it has limitations - the tag content has to be valid inside a CSS class, which means alphanumeric, dash, and underline. There is additional flexibility if you use backslash, but this is uncommon in CSS classes and may not play nice everywhere. Most significantly, it doesn't include a link to wikipedia, only the name, and Semanic Web people really dislike that. I suspect that most people will link to Wikipedia, and if not, a search engine can figure out the most likely host. I mean, how "smart" are your tools when they can't deduce the meaning of wikipedia? If you're using an internal host or very specific database, you can always fall back to RDFa.

Worse is better. The CSS tag format I propose is not as specific as RDFa, but it is easier to implement, harder to mess up, works with non-XHTML, and easy for humans to verify. These generally overlooked and undervalued qualities make adoption easier for people, which is in the end all that really matters.

The NYTimes Doesn't Understand Social Networks

2009-05-04T11:53:00.003-04:00

In "Tinker Away, Facebook Says", the Times summarizes Facebook's recent API expansion announcement with a comparison to a grocery store that "props open the front door and invites everyone to come in, take the merchandise free of charge, and then give it away themselves". They call this "the counterintuitive business wisdom infecting Silicon Valley these days", and proceed to imply that Facebook is doing this from the kindness of their heart.

Let me suggest that Facebook is not a charity but a profit driven company, seeing the consumer primarily as an unavoidable obligation. An API expansion attracts additional users and (more importantly) encourages current users to not leave. A permissive developer API allows more people to participate in innovating their platform for little expense to Facebook. These companies not affiliated with Facebook take risks in developing features that users might want. If users like these features Facebook can incorporate them into the product, otherwise they will wither and die without tarnishing the Facebook brand.

Developers see this as a way to capitalize (however meagerly) on the success of a powerful brand. Consumers see it as a new channel to cater to their needs. What would be better for both developers and consumers is a federated system with non-centralized intellectual property ownership, and thats exactly what Facebook intends to prevent. The most important thing that you should know concerning any Facebook API is that one thing remains the same: Facebook owns its customer's data, and no one, not even the customer, is allowed to export that data from them.

To an extent, this bothers customers. Facebook recently changed their terms of service to say that Facebook owned the property rights all user entered content. Since this would have been catastrophic to artists and writers, there was a substantial backlash that resulted in Facebook rescinding this clause. They additionally apologized and claimed that this was never the intent. It seems unlikely that they would attempt to own all user content in this way, and I believe that they were honest in saying that this was a mistake. What the terms should have said is that Facebook claims ownership of the format and
context of data as it appears in Facebook. You own your posts, but the only way to use them outside of Facebook is to retype them by hand. You own your photographs, but maybe only if you have the originals. And under no circumstances can you export Facebook's bread and butter: the relationships that you have established.

And so Facebook's goal is to make you just happy enough that you won't jump ship. They will claim as much ownership of content that they can for the sole purpose of making it hard to leave. They will make it difficult or impossible to remove content. They will sell as much of your identity as advertisers will buy without the government getting involved. These are what the investors of social networks talk about, and the only way that they can survive as free services.

I am sad to see the Times miss what is blatantly obvious to me. Is it not the age old motto of investigative journalism to follow the money? It may be that our journalists have lost their curiosity. Or possibly that people have generally become consumers, and consumers see themselves not as powerful individuals but as reflected in their relationship to companies. With Social Networks, the money is in owning customer data as much as possible, and selling that data (via keyword advertising, or outright) to advertisers.

If I impress anything on you, let it be that Facebook needs you more than you need Facebook. They need you to buy products so that they can sell advertising, because Facebook is not really a free service. Years from now, the person who owns your past (photographs, notes, messages with friends) will be the one paying for it now, and if you are a Facebook user, that person is not you. The best thing that you can do for your future self is to get involved and start owning your present identity. Data in a product like Wordpress, Blogger (but not Blogspot), or simple HTML on a web server has been paid in full. There is no question about who owns the rights to your past, or what you are allowed to do with it - it is simply yours. Certainly, Facebook has extra features that you cannot get from these truly free products, but comparable features won't exist in other products (free or consumer paid) until there is a demand for them.

So get out there and encourage people to think about the past of the future, which might be tomorrow. Maybe even use Facebook to spread the word - but don't be surprised if they shut you down. After all, they are the ones footing the bill.

Firefox: An Acceptable Cross Platform GUI Toolkit

2009-01-07T13:47:00.002-05:00

A recent Hacker News submission asked what an "acceptable" cross-platform GUI toolkit would be. It discusses common complaints about the existing toolkits GTK+, Qt, Tk, and wxWidgets. It suggests creating a new toolkit with the qualities of being written in C, keeping it simple, LGPL licensing, easily skinnable for different OS's, binding for scripting languages, and be "simple and easy to use".

Good news! I know of an existing GUI toolkit that fits this bill. Its written and extensible in C/C++, has liberal licensing, is very skinnable, has bindings for JavaScript, Python, Java, Perl, and Ruby, and lots of existing open source code that you can learn from. It even has a large, mixed platform install base, so you know that the bugs are minimal. Its based on an object brokering system called XPCOM, and an application framework called Mozilla, but often goes by the name Firefox. Given the popularity of web applications in the startup community, Firefox seems like an obvious option for desktop apps. Yet it hasn't caught on, and I'm not sure why.

I recently worked on a desktop application for my day job. Given their limited interface goals and a game-oriented 3d engine, they had decided to write the interface directly in OpenGL. It was difficult to design interfaces like that, so I did a prototype integration with wxPython. That worked well enough to start real product development, during which I realized that wxPython had many of the "cross-platform toolkit" woes like needless internal complexity, difficulty in writing new components, and general ugliness.

I spent some time contemplating how people wrote successful cross platform desktop applications, and the two things that struck me were (a) there aren't many, (b) except Firefox, (c) and this might underly the popularity of web applications. So I did a new prototype using Firefox with a XUL user interface. What I found was that Firefox worked out great as a standalone application with modular code design, but it was difficult to write good interfaces in XUL. XUL seems to be well tested only as far as the functionality in Firefox is concerned. We switched to Ext-JS, started pretending to be web app devs, and got down to implementing features.

Writing a desktop app based on XULRunner has some interesting side-effects. For one, it makes retargeting your code as a web application or Firefox plugin really easy. It also lets you develop using a variety of hacker friendly languages. Like programming in lisp, it changes your perspective on how applications should be developed, and makes questions like "which UI toolkit is the best?" seem fundamentally flawed. In an age when Apple is rewriting their apps to run in a browser (and still look and feel like desktop apps), isn't writing a code against a desktop UI toolkit fighting the tide?

JavaScript Performance

2008-12-10T13:48:00.007-05:00

There are a lot of JavaScript performance benchmarks flying around out there. Some claim that Chrome shames the competition, others say that Firefox 3.1 is neck and neck with Chrome, others crown WebKit as the fastest of them all.

But what they never seem to do is compare them against other languages. One website, the Computer Language Shootout does, but it never seems to rate JavaScript particularly well. There are a few reasons for this. JavaScript has gotten a lot of attention recently and has thus been rapidly improving, and the CLS doesn't update very often. Tests are also written by different people, and folks who write JavaScript have never been the performance oriented crowd. Finally, the command line harness they use to execute JavaScript isn't representative of how users execute JavaScript.

So, what happens if we try to mitigate some of these? I picked a single test, rewrote it to run as a web page, and ran it using the latest version of browsers that can be considered stable. The results might surprise you.

n = 10

gcc	0.5s	1.0x
java6	0.7s	1.4x
java5	1.1s	2.2x
webkit	1.8s	3.6x
jsc	4.7s	9.4x
minefield	6.4s	12.8x
firefox	19.4s	38.8x
rhino	22.1s	44.2x
python	33.1s	66.2x
rhino	47.8s	95.6x
ruby	58.3s	116.6x
webkit [parallels/xp]	1.0s
chrome [parallels/xp]	2.0s

n = 11

gcc	5.1s	1.0x
java6	6.5s	1.3x
java5	12.6s	2.5x
webkit	23.4s	4.6x
jsc	57.5s	11.3x
minefield	81.2s	15.9x
webkit[parallels/xp]	15.0s
chrome[parallels/xp]	29.0s

Timings done on a Mac Book Pro 2.4 Gz Core 2 Duo / 4 GB RAM. Parallels/XP means running on Windows XP SP3 inside Parallels on the same computer.
WebKit is nightly build 39090.
WebKit [parallels/xp] is nightly build 39088.
jsc is built from svn rev 39090.
python is 2.5.1.
ruby is 1.8.6.

So, interesting things:

The fastest JavaScript implementations are close to the speed of Java.

The coming JavaScript implementations are substantially faster than Python or Ruby.

WebKit is about twice as fast as jsc (the command line interpreter), so the Computer Language Shootout numbers will be inflated.

Java 6 is substantially faster than Java 5.

So, I expect to see more web services written in JavaScript. Why? The argument for Python or Ruby has been that they are much more productive than Java/C/C++/C# that the performance of these languages isn't important. I certainly agree with this. However, JavaScript has about the same level of language productivity, and now has an implementation thats 18 times as fast as python and 32 times as fast as ruby. And you can use the same language across the board for web apps.

I also expect to start seeing desktop apps written in JavaScript. Why use a hacky python or ruby desktop app wrapper when you could use the best cross platform GUI kit there is? And, why bother with a local rails or django instance when you could do everything in a full MVC AJAX kit like Sproutcore or Objective-J?

At the risk of giving away the secrets to my sauce, desktop JavaScript is going to change everything. There are already a couple of frameworks out there (Jaxer, Titanium), but I think that there will be many more to come.

(PS: Lisp FTW!)

Java Static Class Object Fields and Weak References

2008-08-04T16:01:00.005-04:00

In browsing the Mozilla Rhino code, I stumbled upon a garbage collection problem that is truly bizarre. Rhino uses thread local storage for the current Context object. TLS uses a weak dictionary, so it is important that there is no strong circular reference to the key. Years ago (CVS r 1.234), the rhino class Context used to have a static reference to the thread local storage object. This was a problem because even when an array is empty, it has a reference to the class of objects it can hold. Thus, the TLS key (the ThreadLocal object itself) is referenced by the Context class, which is referenced by the (empty) Context[].class, which is a value in the TLS map! This circular dependency caused rhino to leak memory whenever a thread that used JavaScript completed. The solution to this was to change the value to an Object[]. When there are Context objects in the array, the reference still exists, however once the array is empty, there is no longer a reference and the TLS weakref dictionary will cut that key/value pair loose.

The comment, original, and fix.

Wow, thats a lot of detail. Here's the summary: if you have static class fields, make sure they don't eventually point to the key of a weak dictionary. If find this unavoidable, you can break the loop by storing a value as an Object and casting any usage of it. You will still have to set this variable to null in order for GC to happen.

Firefox 3 is the Ugliest Browser Yet

2008-06-17T16:00:00.003-04:00

It's a bit trite for a blog post, but seriously, Firefox 2 looked way better than 3 on Windows. If you customize the toolbar (via right click) to use "small icons", its marginally better, but people actually spent time working on the design of this UI, so I shouldn't have to fix it for them. Everyone would have been much better off with the third party Firefox 1.x theme Charamel.

I haven't used the Mac version yet, but it appears to have the same heinous icon problems. Look at Safari and try again please.

XPCOM nsISupports Proxy Crashes

2008-05-29T17:39:00.003-04:00

In building an application based on XULRunner, we've uncovered a number of bugs in the Mozilla codebase. One that really bothers me is the implementation of asynchronous object proxies. First, here's a brief overview of nsISupports proxies.

XPCOM

XPCOM is a generalized system for calling methods on objects. These objects can be implemented in C++, JavaScript, or Python (using a 3rd party library called PyXPCOM). XPCOM is the mechanism for defining Firefox extensions and writing applications that use XULRunner.

Working With Threads

One handy feature of XPCOM is the ability to have your method call execute in another thread. One example of this is a background thread that wants to do something to the UI, such as update a progress bar. Similarly to Java, Cocoa, and some Windows programming, UI access in Mozilla has to happen on the main thread. Another common situation is sending commands to a worker thread from the main thread. In both cases, we have a simple command to issue, but a direct method call is the wrong way to accomplish this.

So XPCOM has a mechanism for defining proxy objects that look identical to the original object, but executes your request in another thread. There are two flavors of this, synchronous (PROXY_SYNC) which will block your thread, and asynchronous (PROXY_ASYNC) which queues a request in the other thread and continues on. One implication of asynchronous dispatch is that there will be no return value, since you don't wait for the method to finish.

The Async Warning

The page describing proxies contains a very specific warning: if you pass variables by reference and those variables are on the stack, bad things will happen. It turns out that this is a bit understated, and doesn't convey some of the subtleties, so I will try to clarify.

Asynchronous XPCOM Proxies Are Fundamentally Broken

Using them will likely cause you nothing but pain. This pain will be in the form of totally random crashes, with meaningless (but seemingly useful) stack traces. You will ask people for help, and they will have no idea where to start. Unless you are intimately familiar with the XPCOM message dispatch and QueryInterface source code, PROXY_ASYNC should be a keyword which in your mind expands to Bad Bad Evil Crashing.

Why? Well, it turns out that return values are implemented as "out" parameters, which is a fancy way of saying references on the stack, which you may have recently learned is something that you should never mix with PROXY_ASYNC. So, if you were to call any method on an asynchronous proxy that could return a value, it will corrupt memory. There's actually no uncertainty here, XPCOM will write over some random memory address which is probably on some poor thread's stack.

The Workaround

"Ah!" you say, I can use PROXY_ASYNC and still sleep at night as long as I only call methods that return "void". Well, you might sleep a night or two, but soon you'll realize that bad things are still happening. The reality is, there is no workaround (sorry!). Even if you avoid methods with "out" or "in out" parameters, and ones that return values, Mozilla will still punish you by calling exactly these sorts of methods internally. If you're using a scripting language (generally JavaScript), things are even worse for you, because it makes a bunch more of these calls to set up your script object.

The Documentation

This bug is currently documented in buzilla. Unfortunately, there seems to be no workable solution to the situation. On top of that, the people responsible for the code base don't know why anyone would even want to use asynchronous dispatch. So, that doesn't sound good. The only real fix is to find a way to remove any mention of PROXY_ASYNC from your code. Depending on your application, this may be an easy fix. So far for us, this has not been the case. (Mozilla was never intended to do what we're doing)

FYI: JavaScript is Lisp

2008-01-04T12:44:00.000-05:00

Just a heads up in case anyone hasn't figured this out yet.

Some Things Never Change

2007-11-17T17:05:00.000-05:00

I just finished watching a half hour video on computer graphics. It covers the basics of 2D and 3D graphics and basic animation in a way that the average person can understand, but with enough detail to provide basic understanding.

The information itself isn't particularly blog-worthy; there are plenty of resources for learning about computer graphics. There are two reasons that this is interesting to me. First, television seems to have a dwindling amount of educational content. I remember watching 3-2-1 Contact, Mr. Wizard, and NOVA as a kid. The programs currently on PBS don't seem to show the same interest in science, substituting fantasy, social interaction, and ironically, reading. This may be related to my perception that they are also oriented to a younger audience, but I can't help but feel that kids should be learning these things instead of watching TV, not by watching it.

Second, this video is from a program called For All Practical Purposes and was filmed in the late '80s. The computer systems used are from Symbolics, a company that made their own hardware and operating system written entirely in lisp. This idea is similar to the current Squeak project, which is written in smalltalk instead of lisp. The video is old, the computers are old, the technology is old, and yet, it isn't particularly different from the basic concepts that I learned at Georgia Tech a few years ago. The biggest changes have been in the hardware that allows us to animate using physical simulation, and have realtime rendering.

Modern computing doesn't seem that different from 80's computing, except that everything is faster, larger, less efficient, prettier, more connected, and cheaper (much, MUCH cheaper). Just today I learned of a London startup that is writing games that would have run on a 1.2 MHZ Atari 2600 in Macromedia Flash - which uses most of my 1,200 MHZ CPU. But ultimately, its all the same. In my eyes, this makes a stronger case for the idea of Computer Science, because if you ever really learned how computers work, you might understand them forever.

parent and parent and proto, Oh My!

2007-11-16T15:40:00.000-05:00

Sometimes Javascript development can be rather scary. Unlike C or Java, which have well known standards, Javascript (like HTML) is often described by what works on one or more of the current implementations. On top of that, few people see Javascript as a legitimate language, and a vast number of novices are out there writing code that would make your skin crawl.

In short, if you are a knowledgeable programmer, quickly picking up Javascript is much harder than it should be. The syntax is C-like (generally regarded as a plus), the functions are first class (almost universally positive), and the object system is prototype based (at least smalltalk people like this). Soon there will be a common just-in-time compiler, and some people have suggested that it might displace currently popular languages. Personally, I wish that it ditched the prototype object system for a Python-like one, and had lisp-like macros, and just-in-time compilation can't come soon enough.

When a question about a Javascript feature like "x.__proto__" arises, it can be difficult to answer. In my experience so far, the best places to look are MDC and the IRC channel listed on the page.

What is __proto__?

When you make a Javascript "class" (ie constructing function), you can specify a prototype:

function Example() {
  this.foo = 1;
}

Example.prototype = {
  sample : function memberFunction() {},
}

This attribute is copied onto constructed objects as __proto__, like so:

ex = new Example();
if (ex.__proto__ == Example.prototype) print("same!");

__proto__ is a special attribute that is searched if an attribute is not found on the object, it is analogous to a parent class.

parent and __parent__ are substantially different. parent comes from the DOM, and is the tag that encloses the current tag. __parent__ is a special variable that is a reference to the scope that created the object. Like __proto__ is checked when a variable lookup fails. Where __proto__ is checked when the failure looks like this.missing_attribute, __parent__ is checked when the failure is in the form missing_global. According to Mozilla, __parent__ and __proto__ have been deprecated, which is unfortunate since they can be quite useful.

Debugging Python

2007-11-05T23:41:00.000-05:00

Above all else, my greatest annoyance with python is the lack of good documentation and defaults. I bet that there is a group somewhere that knows everything there is to know about python... I beg of them, write that knowledge down!

Here's an example. I want to debug a faulty program:

x = [1,2,3]

def foo():
sum = 0
for i in range(4):
 sum += x[i]

import pdb
pdb.run("foo()")

I'm then prompted by pdb: (> replaced with ])

] [string](1)?()
(Pdb)

Good, now I'm in the debugger. To get things started, I type "c", return, and get:

Traceback (most recent call last):
File "[stdin]", line 1, in ?
File "c:\Python24\lib\pdb.py", line 996, in run
Pdb().run(statement, globals, locals)
File "c:\Python24\lib\bdb.py", line 366, in run
exec cmd in globals, locals
File "[string]", line 1, in ?
File "[stdin]", line 4, in foo
IndexError: list index out of range
]]]

You should notice that the last line is the standard python prompt (with the aforementioned character replacement to make blogger behave), not the debugger. I have finished my debugging session due to an error. Gee, it sure would have been nice to debug that, since my state is now lost! I was already using the debugger to run this code, why did it not catch this exception?

It turns out that exception handling is done by sys.excepthook, and pdb.run doesn't set the excepthook. Some searching turned up two options. The first is simple but crude - add the following to site-packages/sitecustomize.py:

Thomas Heller

import pdb, sys, traceback
def info(type, value, tb):
    traceback.print_exception(type, value, tb)
    pdb.pm()
sys.excepthook = info

The second is more sophisticated, and checks for interactive mode:

ActiveState Python Cookbook.

# code snippet, to be included in 'sitecustomize.py'
import sys

def info(type, value, tb):
   if hasattr(sys, 'ps1') or not sys.stderr.isatty():
      # we are in interactive mode or we don't have a tty-like
      # device, so we call the default hook
      sys.__excepthook__(type, value, tb)
   else:
      import traceback, pdb
      # we are NOT in interactive mode, print the exception...
      traceback.print_exception(type, value, tb)
      print
      # ...then start the debugger in post-mortem mode.
      pdb.pm()

sys.excepthook = info

Here's what I'm currently running:

# code snippet, to be included in 'sitecustomize.py'
import sys

def info(type, value, tb):
   if (#hasattr(sys, "ps1") or
       not sys.stderr.isatty() or 
       not sys.stdin.isatty()):
       # stdin or stderr is redirected, just do the normal thing
       original_hook(type, value, tb)
   else:
       # a terminal is attached and stderr is not redirected, debug 
       import traceback, pdb
       traceback.print_exception(type, value, tb)
       print
       pdb.pm()
       #traceback.print_stack()

original_hook = sys.excepthook
if sys.excepthook == sys.__excepthook__:
    # if someone already patched excepthook, let them win
    sys.excepthook = info

The original ActiveState script doesn't debug if you are running in interactive mode. To me, this makes no sense at all - thats a case where I specifically want to debug.

Alas, if the debugger shows an obvious error that could be fixed, python exceptions cannot be resumed. The code listed here will let you see the stack and move around in it, but your program is no longer running and can never finish where it left off. Allowing this requires call/cc, and as far as I can tell there is no plan to ever support that in python.

If this was helpful, or should be changed, let me know. I've just started using it myself.

Closures in Python

2007-08-12T09:37:00.001-04:00

A closure is data attached to code (pretty simple, eh?). I use them for:

Replacing hard coded constants
Eleminating globals
Providing consistent function signatures
Implementing Object Orientation

(Isn't it funny that people rarely tell you what closures are good for?)
Here is a closure in python:

def makeInc(x):
  def inc(y):
     # x is "closed" in the definition of inc
     return y + x

 return inc

inc5 = makeInc(5)
inc10 = makeInc(10)

inc5 (5) # returns 10
inc10(5) # returns 15

Closures in python are created by function calls. Here, the call to makeInc creates a binding for x that is referenced inside the function inc. Each call to makeInc creates a new instance of this function, but each instance has a link to a different binding of x. The example shows the closure of x being used to eliminate either a global or a constant, depending on the nature of x.

import time
keepRunning = True
updates = []
def runLoop():
   while (keepRunning):
       for u in updates:
           u()

class foo:
   def __init__(self, x = 0):
       self.x = x

   def update(self):
       print self.x
       self.x += 1

f = foo()
g = foo(2)

updates.extend([f.update, g.update])

In python, all methods (but not functions) are closures ... sort of. The method definition foo.update closes the class foo. The value of g.update is a closure that stores the value of g and passes that as the first argument of foo.self, hence the first argument of a method in python is self. Details aside, it is important to note that the designers of python have gone out of their way so that you can pass g.update by itself to another function and have it continue to work correctly.

Caveats

In some languages, the variable bindings contained in a closure behave just like any other variables. Alas, in python they are read-only. This is similar to Java, and has the same solution: closing container objects. Closure of a dictionary or array won't let you assign a new dictionary or array, but will let you change the contents of the container. This is a common use pattern - every time you set a variable on self, you are changing the contents of a closed dictionary.

A Visualization of Visualizations

2007-08-03T01:39:00.000-04:00

You really need to see this one first hand, so check it out.

Working with Python/C

2007-07-06T19:13:00.000-04:00

Some very short notes for working with python from C. I'm using Python 2.4.x, YMMV.

Getting the current error state inside your debugger:

_PyThreadState_Current->curexc_type

Instantiating a "New Style" class (a subclass of "object"):

PyObject* args = PyTuple_New(0);
PyObject* dict = PyDict_New();

// works only with new style classes (subclasses of "object")
PyObject* result = PyObject_Call(classObject, args, dict);

Open Network Sockets on Mac OS X

2007-06-15T12:54:00.000-04:00

This is one of those things that constantly annoys me. On Linux, netstat tells me the list of currently active network connections, including (often most importantly) listening connections. Just knowing that something is running on port 8080 tips me off that I probably have an Apache or Java EJB process running (or maybe a swiki) - a trip to localhost:8080 will answer my question. But what if this port isn't an HTTP server, and doesn't speak when connected to? Now you have a dilemma - there's no way of knowing what has this port open.

Fortunately this is a solved problem on Linux. netstat has some options that tell you the PID of the process with the port open. From here you can use ps to find the name of the process, its path, the user who started it, etc. BTW, you need to run netstat as root to see other people's PID's and those of services.

Well, you know what comes next - netstat on the Mac doesn't show PID's! WTF! Speaking of commands that Mac OS X doesn't have, fuser is missing as well. fuser on Linux tells you which processes have a specific file open - very useful if you're cleaning up files and one has is locked. Well, Mac OS X (BSD really) doesn't have fuser... but it does have a command called lsof. lsof isn't quite as user friendly as fuser. It only has one mode, which is to list every open file that is visible to you (this is a subtle hint that you should run it as root to see more files). This means that fuser <filename> roughly translates to lsof | grep <filename>. Very useful for finding that stray service that has outlived its welcome and is holding files hostage.

Still, the problem at hand is finding the PID of network sockets. It turns out that in POSIX, network sockets are pretty much the same as files. This means that they show up in lsof if you ask nicely. And since lsof shows PID's (it even gets fancy and shows the process name), it turns out to be the solution. So, here's the money shot:

sudo lsof -i -P

This produces something that looks like:


COMMAND    PID  USER   FD   TYPE     DEVICE SIZE/OFF   NODE NAME
launchd      1  root    9u  IPv4 0x01d96e10      0t0    UDP *:137
launchd      1  root   10u  IPv4 0x020dbe8c      0t0    TCP *:139 (LISTEN)
launchd      1  root   11u  IPv4 0x020dbb38      0t0    TCP *:445 (LISTEN)
launchd      1  root   12u  IPv6 0x01d99c50      0t0    TCP *:22 (LISTEN)
launchd      1  root   13u  IPv4 0x020db7e4      0t0    TCP *:22 (LISTEN)
mDNSRespo   43  root    7u  IPv4 0x01d96ad0      0t0    UDP *:5353
mDNSRespo   43  root    8u  IPv6 0x01d96a00      0t0    UDP *:5353
mDNSRespo   43  root    9u  IPv4 0x03379d40      0t0    UDP *:5353
mDNSRespo   43  root   13u  IPv4 0x03379860      0t0    UDP 192.168.1.100:53891
mDNSRespo   43  root   14u  IPv4 0x032f82a4      0t0    TCP *:* (CLOSED)
mDNSRespo   43  root   15u  IPv4 0x02edb554      0t0    TCP 192.168.1.100:5000 (LISTEN)
mDNSRespo   43  root   16u  IPv4 0x020da098      0t0    TCP *:* (CLOSED)

... (lots more here)

If you don't have root access, you can still use lsof, but you won't see the plethora of system services and other users processes.

Python Techniques You've Been Googling For

2007-05-29T12:43:00.000-04:00

I've been doing it, I bet you have too. Here's some topics that have come up lately, I'll be writing about each of them in time. Since you probably got here via Google, add a comment for the subject that got you here.

Closures
Debugging
Creating functions
Metaclasses
Performance Optimization
Objects in depth

asdf-install on openMCL

2007-03-17T13:45:00.001-04:00

A Note: This has been sitting in my drafts folder for 10 months. I'm
posting it unedited because its better out then in, right?

---

This should be pretty trivial... after all, its included with the
latest openMCL, right? (I'm using the latest CVS)

Well, I've been fighting to get it working for almost a week, most
likely because I tend to ignore files named README. Having some
experience with CPAN and rubygems, it should really be possible to get
packages from a package repository without even knowing the language
that you are using. Why? Because newbies want to see results before
they have to invest time in learning new things. In the worst case,
an end user doesn't know, or even want to know, how to program at all.
Package repositories should really just work.

Here are my notes so far:

You need to compile asdf-install. To do so, you should be in the
directory containing asdf-install.asd . Then run:
(asdf:operate 'asdf:compile-op :asdf-install)
(asdf:operate 'asdf:load-op :asdf-install)
Now, make your asdf registry:

mkdir /usr/local/share/ccl/asdf-registry
cp asdf-

You need to create ~/openmcl-init.lisp with:
(require 'asdf)
(setf asdf:*central-registry*
'(*default-pathname-defaults*
#p"/usr/local/asdf-install/site-systems/" ; where
asdf-install puts things
#p"/usr/local/share/ccl/asdf-registry/" ; where i put
asdf-install.asd
))
(asdf:operate 'asdf:load-op :asdf-install)

;;; its really best to install GPG
(setf asdf-install:*VERIFY-GPG-SIGNATURES* nil)

The central registry setup is important - maybe
asdf-install/site-systems should be the same folder as the asdf
registry... I dunno. What I do know is that they both need to be in
the asdf search path (ie asdf:*central-registry*) or you won't get
very far.

I hope that I'm misguided, this is all a bad dream, and if I only
followed some magic instructions, this would be trivial. For now,
this works for me, and thats all that really matters.

Old And In The Way (stale fasl's)

2006-12-03T21:37:00.000-05:00

SBCL 1.0 has been released, and as expected, it breaks some things on my Powerbook. One specific problem is stale "fasl" files. Honestly, I don't even know what they are... I suspect them to be some kind of optimized bytecode used by ASDF to improve loading speed, but really (as long as they work) I don't care what they are.

SBCL will stutter something like this:
debugger invoked on a SB-FASL::INVALID-FASL-VERSION: #<SB-SYS:FD-STREAM for "file /usr/local/lib/sbcl/site/rt-20040621/rt.fasl" {11A024F9}> is in native code fasl file format version 70, but this version of SBCL uses format version 71.
Searching cliki revealed that these are intermediate files that can simply be deleted. They suggest that deleting files by hand is tedious and have a nice code snippit that should make things magically refresh in the future... I'm a little more pragmatic:
find /usr/local/lib/sbcl -name '*.fasl' -exec rm {} \;
Works fine afterwards.

Update: I was a little quick to declare victory. After executing this, i was unable to (require 'asdf), since the fasl for asdf had been deleted. I reinstalled (executed sbcl-1.0/install.sh) and the problem is fixed. This seems to invalidate my previous suggestion... Maybe I should have filtered the search to only include files older than a certain date. The cliki code snippit is probably the better way to go for now.