SOLR Logo white bgSome of you read my previous posts The state of open source search. I will in this post go through the process of downloading, installing, configuring and using Apache SOLR to index some sample XML data and search it.

This is the first post in a series, where each new post will explore some new feature. We will simply follow the tutorial to get SOLR up and running locally.

We start by visiting the SOLR Tutorial for the first steps, and simply get the app running:

  1. Check Java version, download SOLR (I chose this file), unpack it, and cd to apache-solr-1.2.0/example
  2. java -jar start.jar
  3. Visit http://localhost:8983/solr/admin/ to see the admin page with a simple test GUI
  4. Try hit the search button, you get the XML response back with 0 results
  5. Now try indexing some content. CD to example/exampledocs
  6. java -jar post.jar solr.xml monitor.xml
  7. Now try searching again and see that you get some hits!

For part two we will get SOLR running in a Tomcat instance, customize the schema a bit and present search results in a custom web page using the client API.

Rana vs Wium Lie

May 2nd, 2008

Note: Links are to Norwegian sites.

Shahzad RanaHåkon Wium LieIn a recnt post on Shahzad Rana’s (Microsoft’s most profiled OOXML promoter in Norway) blog, he comments on Håkon Wium Lie’s (Opera Software’s tech director and profiled standards promoter) wording in a comment to VG TV. Here, Lie introduces the term “Microsoft tax” to explain what happens when ordinary people feel forced to purchase MS-Office to read documents from the government or their kid’s scool. Lie says that the consequence of widespread use of Microsoft Office’s new document format OOXML, could be many more years of vendor lock-in since OOXML allows arbitrary non-standardized, non-open extensions. An example is if a parent recives a document from her kid’s teacher, which contains an Equation binary object, which is not part of the OOXML specification, and thus cannot possibly be implemented by other office packages wanting to support OOXML.

Futher, Rana asks Lie to produce some evidence of an OOXML document from a teacher to a student or parent that is only readable on Windows and MS Office, whereby Lie refers to an OOXML document that Rana himself had sent by email. A funny thing here is that Rana had to rename the .docx file as .doc to be able to upload it to WordPress. This caused a lot of trouble for the users, thus examplifying even stronger what kind of trouble the new format would cause for ordinary people. Rana should of course have zipped the file, or better, modified WordPress to accept .docx files for upload. But a MS supporter is probably not used to the idea of freely being able to modify ones own GPL software :-)

FAST MS Logo

Today, the deal where Microsoft buys FAST, was completed. That means that the Norwegian search engine vendor Fast Search & Transfer is now a fully owned subsidiary of Microsoft.

The FAST ESP product will continue to be offered on all current platforms, and the FAST sales and tech organization continues to operate almost as before, so customers and users will not experience any noise around this transaction.

FAST, when under the MS umbrella, will of course increase focus within the MS Office Sharepoint segment, and will together with MS engineers make an even smoother packaging of the technologies to new and existing customers of high-end Sharepoint sites with large data volumes.

Expect to see continued innovation from FAST in the years to come, and expect also to see a shift towards stronger support for the Windows platform. It is a known fact that the Linux platform has been the most stable up until now for ESP, but now this might shift as Windows versions will get the major focus in QA and patching.

Let us not hope that the Linux, AIX and Solaris versions will be discontinued. I don’t expect that to happen in the short term, as the press release clearly states that they will be supported, and also this blog post by MS’s Kirk Koenigsbauer in the Sharepoint division states that “We’re making a pragmatic decision to continue to delight a core part of FAST’s customer base that has chosen the Linux/UNIX OS. You can bet that we’ll innovate on Windows, too, and over time we hope customers will see .NET as a preferred platform choice. Let’s hope that lasts for many many years to come, so that history can be re-written in this area.

Congratulations, Microsoft, with an excellent new member organization

Congratulations, John Marcus Lervik with the new role of leading MS’s Enterprise Search Business!

See also official press release and FAST’s customer FAQ