<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Cominvent AS - Enterprise search consultants</title>
	<atom:link href="http://www.cominvent.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.cominvent.com</link>
	<description>Search, and you will find!</description>
	<lastBuildDate>Wed, 10 Apr 2013 21:05:23 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>FAST ESP support ends this summer</title>
		<link>http://www.cominvent.com/2013/04/10/fast-esp-support-ends-this-summer/</link>
		<comments>http://www.cominvent.com/2013/04/10/fast-esp-support-ends-this-summer/#comments</comments>
		<pubDate>Wed, 10 Apr 2013 20:51:00 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=884</guid>
		<description><![CDATA[FAST&#8217;s good old enterprise search platform, ESP™ has not been upgraded or improved since Microsoft took over the business in 2008. And this summer it is the end of support and hotfixes too. Most ESP users have already taken action &#8230; <a href="http://www.cominvent.com/2013/04/10/fast-esp-support-ends-this-summer/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.cominvent.com/wp-content/uploads/2010/11/FAST-ESP-logo1.png"><img class="alignright size-full wp-image-389" alt="FAST ESP logo" src="http://www.cominvent.com/wp-content/uploads/2010/11/FAST-ESP-logo1.png" width="244" height="48" /></a>FAST&#8217;s good old enterprise search platform, ESP™ has not been upgraded or improved since Microsoft took over the business in 2008. And this summer it is the <a href="http://support.microsoft.com/lifecycle/search/default.aspx?sort=PN&amp;alpha=fast+esp" target="_blank">end of support</a> and hotfixes too.</p>
<p>Most ESP users have already taken action and completed or at least started a migration project. Those who want to stay within the Microsoft world, must move to Sharepoint. For others that is an awkward solution, and for several of these, we have facilitated migrations to Open Source solutions such as Apache Solr™ or ElasticSearch™.</p>
<p>Still not decided what to do? Call us for a chat. We know ESP inside/out and also Lucene based solutions and may guide you through a successful migration whether you like to do 10% or 90% of it yourself. See our <a title="Migrating" href="http://www.cominvent.com/migrating/">migration page</a> for more.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2013/04/10/fast-esp-support-ends-this-summer/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Solr 4.2.1 released</title>
		<link>http://www.cominvent.com/2013/04/04/solr-4-2-1-released/</link>
		<comments>http://www.cominvent.com/2013/04/04/solr-4-2-1-released/#comments</comments>
		<pubDate>Thu, 04 Apr 2013 09:34:44 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=875</guid>
		<description><![CDATA[Today we released version 4.2.1 of Apache Solr and Lucene. See http://lucene.apache.org/solr/solrnews.html for the release announcement. This is mostly a bugfix release over 4.2, including such things as Collections API fixes, SolrCloud replication fixes and more. If you missed the &#8230; <a href="http://www.cominvent.com/2013/04/04/solr-4-2-1-released/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>Today we released version 4.2.1 of Apache Solr and Lucene. See <a href="http://lucene.apache.org/solr/solrnews.html">http://lucene.apache.org/solr/solrnews.html</a> for the release announcement.</p>
<p>This is mostly a bugfix release over 4.2, including such things as Collections API fixes, SolrCloud replication fixes and more.</p>
<p>If you missed the 4.2 release, the major news was that stored fields are now compressed by default, meaning you could get a significantly smaller index footprint if you store a lot of fields.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2013/04/04/solr-4-2-1-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Apache Solr™ 4.1 released</title>
		<link>http://www.cominvent.com/2013/01/24/apache-solr-4-1-released/</link>
		<comments>http://www.cominvent.com/2013/01/24/apache-solr-4-1-released/#comments</comments>
		<pubDate>Thu, 24 Jan 2013 15:24:20 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=646</guid>
		<description><![CDATA[Yesterday we released version 4.1 of Apache Solr and Lucene. See http://lucene.apache.org/solr/solrnews.html for the release announcement. This is mostly a bugfix release, including such things as SolrCloud stability fixes, Admin GUI works with Internet Explorer 9 etc. Hereby recommended.]]></description>
				<content:encoded><![CDATA[<p>Yesterday we released version 4.1 of Apache Solr and Lucene. See http://lucene.apache.org/solr/solrnews.html for the release announcement. This is mostly a bugfix release, including such things as SolrCloud stability fixes, Admin GUI works with Internet Explorer 9 etc. Hereby recommended.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2013/01/24/apache-solr-4-1-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Apache Solr™ 4.0 released</title>
		<link>http://www.cominvent.com/2012/11/14/apache-solr-4-0-released/</link>
		<comments>http://www.cominvent.com/2012/11/14/apache-solr-4-0-released/#comments</comments>
		<pubDate>Tue, 13 Nov 2012 23:22:03 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Solr]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=636</guid>
		<description><![CDATA[On October 12th we finally released version 4.0 of Apache Solr. See http://lucene.apache.org/solr/solrnews.html for the release announcement. So if you have been reluctant to start using 4.0 beta, now is the time to upgrade. You will enjoy instant improvements in &#8230; <a href="http://www.cominvent.com/2012/11/14/apache-solr-4-0-released/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>On October 12th we finally released version 4.0 of Apache Solr. See <a href="http://lucene.apache.org/solr/solrnews.html">http://lucene.apache.org/solr/solrnews.html</a> for the release announcement. So if you have been reluctant to start using 4.0 beta, now is the time to upgrade. You will enjoy instant improvements in performance and disk/memory footprint just by upgrading, not to mention all the cool new features such as SolrCloud and more. Enjoy!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2012/11/14/apache-solr-4-0-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Scaling search with SolrCloud</title>
		<link>http://www.cominvent.com/2012/11/13/scaling-search-with-solrcloud/</link>
		<comments>http://www.cominvent.com/2012/11/13/scaling-search-with-solrcloud/#comments</comments>
		<pubDate>Tue, 13 Nov 2012 17:29:15 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Search technology]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[The Cloud]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=633</guid>
		<description><![CDATA[Jan had a talk at JavaZone in Oslo this september, about scaling search with the new Solr 4.0 with SolrCloud. Here are the slides and the video recording: Scaling search with Apache Solr 4.0 and SolrCloud from JavaZone on Vimeo. &#8230; <a href="http://www.cominvent.com/2012/11/13/scaling-search-with-solrcloud/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>Jan had a talk at JavaZone in Oslo this september, about scaling search with the new Solr 4.0 with SolrCloud. Here are the slides and the video recording:</p>
<p><iframe src="http://player.vimeo.com/video/49368443?badge=0" width="500" height="281" frameborder="0" webkitAllowFullScreen mozallowfullscreen allowFullScreen></iframe>
<p><a href="http://vimeo.com/49368443">Scaling search with Apache Solr 4.0 and SolrCloud</a> from <a href="http://vimeo.com/javazone">JavaZone</a> on <a href="http://vimeo.com">Vimeo</a>.</p>
<p><iframe src="http://www.slideshare.net/slideshow/embed_code/14267345" width="427" height="356" frameborder="0" marginwidth="0" marginheight="0" scrolling="no" style="border:1px solid #CCC;border-width:1px 1px 0;margin-bottom:5px" allowfullscreen webkitallowfullscreen mozallowfullscreen> </iframe>
<div style="margin-bottom:5px"> <strong> <a href="http://www.slideshare.net/janhoy/scaling-search-with-solr-cloud" title="Scaling search with Solr Cloud" target="_blank">Scaling search with Solr Cloud</a> </strong> from <strong><a href="http://www.slideshare.net/janhoy" target="_blank">Cominvent AS</a></strong> </div>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2012/11/13/scaling-search-with-solrcloud/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lucene/Solr 4.0 Beta is out</title>
		<link>http://www.cominvent.com/2012/08/18/lucenesolr-4-0-beta-is-out/</link>
		<comments>http://www.cominvent.com/2012/08/18/lucenesolr-4-0-beta-is-out/#comments</comments>
		<pubDate>Sat, 18 Aug 2012 12:45:25 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=624</guid>
		<description><![CDATA[This week the ß version of Apache Lucene/Solr was released, see release announcement. This means that things are getting stable and approaching the FINAL release &#8211; after a looong time in the makings. The most important changes is perhaps better &#8230; <a href="http://www.cominvent.com/2012/08/18/lucenesolr-4-0-beta-is-out/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><img class="alignright" title="Beta" src="http://www.asliceofleadership.com/wp-content/uploads/2012/02/Leadership-Beta.gif" alt="" width="300" height="298" />This week the ß version of Apache Lucene/Solr was released, see <a href="http://www.gossamer-threads.com/lists/lucene/general/164266?page=last" target="_blank">release announcement</a>. This means that things are getting stable and approaching the FINAL release &#8211; after a looong time in the makings.</p>
<p>The most important changes is perhaps better maturity for <a href="http://wiki.apache.org/solr/SolrCloud/" target="_blank">SolrCloud</a> along with a lot of bug fixes. If you need the ease of scalability that SolrCloud brings, this beta is a safe version to get started with &#8211; even in production. We (the developers) promise that we&#8217;ll not break backwards compat unless absolutely necessary due to a bug or something.</p>
<p>Some of the contributions from Cominvent include <a href="https://issues.apache.org/jira/browse/SOLR-3377">SOLR-3377</a>, <a href="https://issues.apache.org/jira/browse/SOLR-1856">SOLR-1856</a>, <a href="https://issues.apache.org/jira/browse/SOLR-1929">SOLR-1929</a>, <a href="https://issues.apache.org/jira/browse/SOLR-3672">SOLR-3672</a>, <a href="https://issues.apache.org/jira/browse/SOLR-3467">SOLR-3467</a>, <a href="https://issues.apache.org/jira/browse/SOLR-3660">SOLR-3660</a>, <a href="https://issues.apache.org/jira/browse/SOLR-3648">SOLR-3648</a>, <a href="https://issues.apache.org/jira/browse/SOLR-3439">SOLR-3439</a>, <a href="https://issues.apache.org/jira/browse/SOLR-2802">SOLR-2802</a>, <a href="https://issues.apache.org/jira/browse/SOLR-3255">SOLR-3255</a>, <a href="https://issues.apache.org/jira/browse/SOLR-3351">SOLR-3351</a> and more.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2012/08/18/lucenesolr-4-0-beta-is-out/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Amazon releases CloudSearch</title>
		<link>http://www.cominvent.com/2012/04/17/amazon-releases-cloudsearch/</link>
		<comments>http://www.cominvent.com/2012/04/17/amazon-releases-cloudsearch/#comments</comments>
		<pubDate>Tue, 17 Apr 2012 15:01:30 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Solr]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[The Cloud]]></category>
		<category><![CDATA[Trends]]></category>
		<category><![CDATA[CloudSearch]]></category>
		<category><![CDATA[Elastic Search]]></category>
		<category><![CDATA[Solr 4.0]]></category>
		<category><![CDATA[SolrCloud]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=612</guid>
		<description><![CDATA[Amazon has really impressed me with their efficient and simple cloud solutions. Last week they announced a hosted search solution in their cloud, named CloudSearch. The idea is the same as for all the other cloud services, namely that they &#8230; <a href="http://www.cominvent.com/2012/04/17/amazon-releases-cloudsearch/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><img class="size-medium wp-image-613 alignleft" title="amazon cloud search" src="http://www.cominvent.com/wp-content/uploads/2012/04/amazon-cloud-search-300x183.jpg" alt="" width="300" height="183" />Amazon has really impressed me with their efficient and simple cloud solutions. <a href="http://aws.amazon.com/about-aws/whats-new/2012/04/11/aws-announces-cloudsearch/">Last week they announced a hosted search solution in their cloud</a>, named CloudSearch. The idea is the same as for all the other cloud services, namely that <strong>they</strong> take care of all the hosting, provisioning and scaling (CloudSearch can auto-scale) while <strong>you</strong> concentrate on indexing and searching.<span id="more-612"></span></p>
<p>So what do you get and how does this compare to Solr? Well, the engine powering CloudSearch is Amazon&#8217;s own <a href="http://en.wikipedia.org/wiki/A9.com">A9 technology</a>. It features many of the same basic features as Lucene and Solr but there are also differences. You get plain field search, boolean search, range search, facets, stemming, stopwords, synonyms and a relevancy search. The relevancy search can be tuned using a JavaScript like syntax in which you tell it what to boost. The concept reminds me of Solr&#8217;s FunctionQueries which lets you &#8220;program&#8221; your own boosts.</p>
<p>If you want to run Solr but be able to scale it into the cloud, CloudSearch won&#8217;t do that for you. Then you should consider using SolrCloud on Amazon EC2 or another cloud platform instead. There are even providers of hosted plans out there such as <a href="http://www.indexdepot.com/">IndexDepot</a> and <a href="http://www.websolr.com/">WebSolr</a> which take care of the hosting part for you.</p>
<p>I look forward to playing with the service the next time I find myself wondering what to do <img src='http://www.cominvent.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2012/04/17/amazon-releases-cloudsearch/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Solr 3.6 released</title>
		<link>http://www.cominvent.com/2012/04/17/solr-3-6-released/</link>
		<comments>http://www.cominvent.com/2012/04/17/solr-3-6-released/#comments</comments>
		<pubDate>Tue, 17 Apr 2012 14:26:23 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Solr]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[CurrencyField]]></category>
		<category><![CDATA[Solr 3.6]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=603</guid>
		<description><![CDATA[Version 3.6 of Apache Solr was recently released. This will be the last 3.x release before 4.0 will come out later this year. Version 3.6 contains many improvements and bug fixes. Here&#8217;s the release statement: 12 April 2012, Apache Solr™ &#8230; <a href="http://www.cominvent.com/2012/04/17/solr-3-6-released/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>Version 3.6 of Apache Solr was recently released. This will be the last 3.x release before 4.0 will come out later this year. Version 3.6 contains many improvements and bug fixes.</p>
<p>Here&#8217;s the release statement:<span id="more-603"></span></p>
<table width="100%">
<tbody>
<tr bgcolor="#CCCCCC">
<td>12 April 2012, Apache Solr™ 3.6.0 available</p>
<p>The Lucene PMC is pleased to announce the release of Apache Solr 3.6.0.</p>
<p>Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database<br />
integration, rich document (e.g., Word, PDF) handling, and geospatial search. Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world&#8217;s<br />
largest internet sites.</p>
<p>This release contains numerous bug fixes, optimizations, and improvements, some of which are highlighted below. The release is available for immediate download at: <a href="http://lucene.apache.org/solr/mirrors-solr-latest-redir.html">http://lucene.apache.org/solr/mirrors-solr-latest-redir.html</a> (see note below).</p>
<p>See the <a href="http://svn.apache.org/viewvc/lucene/dev/branches/lucene_solr_3_6/solr/CHANGES.txt?view=co">CHANGES.txt</a> file included with the release for a full list of<br />
details.</p>
<p>Solr 3.6.0 Release Highlights:</p>
<ul>
<li>New SolrJ client connector using Apache Http Components http client (<a href="http://issues.apache.org/jira/browse/SOLR-2020" target="_new">SOLR-2020</a>)</li>
<li>Many analyzer factories are now &#8220;multi term query aware&#8221; allowing for things like field type aware lowercasing when building prefix &amp; wildcard queries (<a href="http://issues.apache.org/jira/browse/SOLR-2438" target="_new">SOLR-2438</a>)</li>
<li>New Kuromoji morphological analyzer tokenizes Japanese text, producing both compound words and their segmentation. (<a href="http://issues.apache.org/jira/browse/SOLR-3056" target="_new">SOLR-3056</a>)</li>
<li>Range Faceting (Dates &amp; Numbers) is now supported in distributed search (<a href="http://issues.apache.org/jira/browse/SOLR-1709" target="_new">SOLR-1709</a>)</li>
<li>HTMLStripCharFilter has been completely re-implemented, fixing many bugs and greatly improving the performance (<a href="http://issues.apache.org/jira/browse/LUCENE-3690">LUCENE-3690</a>)</li>
<li>StreamingUpdateSolrServer now supports the javabin format (<a href="http://issues.apache.org/jira/browse/SOLR-1565" target="_new">SOLR-1565</a>)</li>
<li>New LFU Cache option for use in Solr&#8217;s internal caches. (<a href="http://issues.apache.org/jira/browse/SOLR-2906" target="_new">SOLR-2906</a>)</li>
<li>Memory performance improvements to all FST based suggesters (<a href="http://issues.apache.org/jira/browse/SOLR-2888" target="_new">SOLR-2888</a>)</li>
<li>New WFSTLookupFactory suggester supports finer-grained ranking for suggestions. (<a href="http://issues.apache.org/jira/browse/LUCENE-3714">LUCENE-3714</a>)</li>
<li>New options for configuring the amount of concurrency used in distributed searches (<a href="http://issues.apache.org/jira/browse/SOLR-3221" target="_new">SOLR-3221</a>)</li>
<li>Many bug fixes</li>
</ul>
<p>Note: The Apache Software Foundation uses an extensive mirroring network for<br />
distributing releases. It is possible that the mirror you are using may not<br />
have replicated the release yet. If that is the case, please try another<br />
mirror. This also goes for Maven access.</p>
<p>Happy searching,</p>
<p>Lucene/Solr developers</td>
</tr>
</tbody>
</table>
<p>In addition to these news, Jan Høydahl has also contributed these features and fixes to 3.6:</p>
<ul>
<li><a href="http://issues.apache.org/jira/browse/SOLR-1052" target="_new">SOLR-1052</a>: Deprecate/Remove and in favor of in solrconfig.xml</li>
<li><a href="http://issues.apache.org/jira/browse/SOLR-2202" target="_new">SOLR-2202</a>: Money/Currency FieldType (Handling multiple currencies in the same field See <a href="http://wiki.apache.org/solr/CurrencyField">http://wiki.apache.org/solr/CurrencyField</a>)</li>
<li><a href="http://issues.apache.org/jira/browse/SOLR-2970" target="_new">SOLR-2970</a>: CSV ResponseWriter returns fields defined as stored=false in schema</li>
<li><a href="http://issues.apache.org/jira/browse/SOLR-3026" target="_new">SOLR-3026</a>: eDismax: Locking down which fields can be explicitly queried (<a href="http://wiki.apache.org/solr/ExtendedDisMax#uf_.28User_Fields.29">user fields aka uf</a> - control what fields users may query)</li>
<li><a href="http://issues.apache.org/jira/browse/SOLR-2764" target="_new">SOLR-2764</a>: Create a <a href="http://wiki.apache.org/solr/LanguageAnalysis#Norwegian">NorwegianLightStemmer</a> and NorwegianMinimalStemmer &#8211; simpler and sometimes better than the Snowball</li>
<li><a href="http://issues.apache.org/jira/browse/SOLR-2826" target="_new">SOLR-2826</a>: URLClassify Update Processor (good for web search)</li>
<li><a href="http://issues.apache.org/jira/browse/SOLR-3140" target="_new">SOLR-3140</a>: Make omitNorms default for all numeric field types (simplifies schema, better default)</li>
<li><a href="http://issues.apache.org/jira/browse/SOLR-2901" target="_new">SOLR-2901</a>: Upgrade Solr to Tika 1.0</li>
<li><a href="http://issues.apache.org/jira/browse/SOLR-3009" target="_new">SOLR-3009</a>: hitGrouped.vm isn&#8217;t shipped with 3.x (fixes grouping in the /browse GUI)</li>
<li><a href="http://issues.apache.org/jira/browse/SOLR-3059" target="_new">SOLR-3059</a>: Provide example for XSLTUpdateRequest (allows you to POST a Solr search result into another Solr index as XML)</li>
<li><a href="http://issues.apache.org/jira/browse/SOLR-2487" target="_new">SOLR-2487</a>: New Ant build target to not include slf4j-jdk14 jar in WAR (nice if you want to use another log framework than JUL)</li>
</ul>
<p>This version is backwards compatible with the other 3.x versions, so upgrading is as simple as replacing the WAR file.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2012/04/17/solr-3-6-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Super flexible AutoComplete with Solr</title>
		<link>http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/</link>
		<comments>http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/#comments</comments>
		<pubDate>Wed, 25 Jan 2012 17:34:17 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Solr]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=576</guid>
		<description><![CDATA[AutoComplete or AutoSuggest has in recent years become a &#8220;must-have&#8221; search feature. Solr can do AutoComplete in a number of ways (such as Suggester, TermsComponent and Faceting using facet.prefix), but in this post we&#8217;ll consider a more advanced and flexible option, namely querying a &#8230; <a href="http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/Autocomplete#In_search_engines" target="_blank">AutoComplete</a> or AutoSuggest has in recent years become a &#8220;must-have&#8221; search feature. Solr can do AutoComplete in a number of ways (such as <a href="http://wiki.apache.org/solr/Suggester" target="_blank">Suggester</a>, <a href="http://wiki.apache.org/solr/TermsComponent" target="_blank">TermsComponent</a> and Faceting using <a href="http://wiki.apache.org/solr/SimpleFacetParameters#facet.prefix" target="_blank">facet.prefix</a>), but in this post we&#8217;ll consider a more advanced and flexible option, namely querying a dedicated Solr Core search index for the suggestions. You may think that this sounds heavy weight, but we&#8217;re talking small data here so it is really efficient and snappy!</p>
<p><a href="http://www.cominvent.com/wp-content/uploads/2012/01/Screen-Shot-2012-02-16-at-16.33.50.png"><img class="alignnone size-full wp-image-600" title="Screen Shot 2012-02-16 at 16.33.50" src="http://www.cominvent.com/wp-content/uploads/2012/01/Screen-Shot-2012-02-16-at-16.33.50.png" alt="" width="420" height="257" /></a><br clear="all" /> Even if it&#8217;s some work setting up, the benefits to this approach are really compelling:<span id="more-576"></span></p>
<ol>
<li>Suggest on multi word or sentences</li>
<li>Suggests on prefix of whole line and/or individual words</li>
<li>Full relevancy tuning capabilities of Solr (in contrast to a single frequency sorting from TermsComponent or Faceting)</li>
<li>Increased recall using Phonetics, Fuzzy, Character normalization, or any other Solr feature</li>
<li>Rich suggestions including thumbnails, extra texts etc</li>
<li>Easy to mix different &#8220;categories&#8221; of suggestsions, e.g. &#8220;Book titles&#8221;, &#8220;Authors&#8221;, &#8220;Genres&#8221;, and group these together</li>
<li>Much more</li>
</ol>
<p>So let&#8217;s get to it. To make it easy, we&#8217;ve shared with you a ready to run example setup at <a href="https://github.com/cominvent/autocomplete" target="_blank">https://github.com/cominvent/autocomplete</a> which provides a template for you to build upon for your own needs. First we&#8217;ll guide you through downloading and running the example, indexing some example data with names of all countries and all major cities in the world. Then we&#8217;ll look more behind the scenes to explain how it&#8217;s all setup.</p>
<h2>Running the example</h2>
<p>The example contains a complete solr-home configured with an &#8220;ac&#8221; core, and some example data in CSV format which we&#8217;ll feed to our core using HTTP POST (curl). Solr itself is not included in the example, so you&#8217;ll need to download <a href="http://www.apache.org/dyn/closer.cgi">Solr 3.5</a>or newer first. We&#8217;ll then start the Solr example app, pointing to our autocomplete solr-home.</p>
<ol>
<li>Download and unpack Solr3.5 if you have not already <a href="http://www.apache.org/dyn/closer.cgi/lucene/solr/" target="_blank">http://www.apache.org/dyn/closer.cgi/lucene/solr/</a></li>
<li>Download and unpack autocomplete example from GitHub: <a href="https://github.com/cominvent/autocomplete/zipball/master" target="_blank">https://github.com/cominvent/autocomplete/zipball/master</a> (alternatively check out the code using git)</li>
<li>Cd to the autocomplete folder, open README.TXT and follow the instructions. When done you will have Solr up and running with the example data indexed into the &#8220;ac&#8221; core.</li>
<li>When you browse to http://localhost:8983/solr/ac/browse and start typing, you&#8217;ll see countries and cities suggested</li>
</ol>
<div>The population of countries and cities is used as the major ranking factor. But textual relevancy is also used, so that complete matches score higher than partial matches and more.</div>
<h2>Behind the scenes</h2>
<div>So let&#8217;s dive down in the details. The main layers in our example are (see solr-home/ac/conf):</div>
<div>
<ol>
<li>JavaScript hook (in velocity/head.vm) which performs a query to Solr whenever you type a character in the search box</li>
<li>A velocity template (velocity/suggest.vm), adapted from Solr&#8217;s velocity contrib, which reformats the Solr response the way we want it</li>
<li>jQuery AutoComplete plugin which renders the results output from suggest.vm</li>
<li>The Solr core configuration itself as defined by schema.xml and solrconfig.xml</li>
<li>The example data which is indexed into the &#8220;ac&#8221; core</li>
</ol>
</div>
<div>In your own application you will want to tailor all of these. If your site is PHP driven you&#8217;ll create a PHP page instead of our our Velocity templates etc.</div>
<h3>1. Querying Solr from the page</h3>
<p>This Javascript makes prepares the URL to query. Here we ask Solr directly, but normally you&#8217;ll want to call some (PHP/JSP/ASPX) servlet of yours which in turn queries the private Solr server. So put the URL to your servlet. Your servlet will then call the autocomplete core similar to what we do here. We call the normal &#8220;/select&#8221; handler, but with qt=ac we select the handler we want, and instead of XML back we ask for a velocity template called &#8220;suggest&#8221; (&amp;wt=velocity&amp;v.template=suggest), which formats one suggestion per line. Finally we ask for 20 results.</p>
<pre><a href="http://www.cominvent.com/wp-content/uploads/2012/01/Screen-Shot-2012-01-25-at-17.37.01.png"><img class="alignnone size-full wp-image-589" title="Screen Shot 2012-01-25 at 17.37.01" src="http://www.cominvent.com/wp-content/uploads/2012/01/Screen-Shot-2012-01-25-at-17.37.01.png" alt="" width="562" height="231" /></a></pre>
<h3>2. Format the response to suit your Ajax component</h3>
<p>In our example we use <a href="https://github.com/cominvent/autocomplete/blob/master/solr-home/ac/conf/velocity/suggest.vm" target="_blank">suggest.vm</a> to format the results to make it suitable for our AutoComplete component. You would probably want to do this formatting in your client code instead. Our simple jQuery plugin expects one suggestion text per line. A more advanced Ajax component may be able to take in multiple pieces of information, such as thumbnail URLs etc.</p>
<h3>3. The Autocomplete component</h3>
<p>We use jQuery autocomplete component, and it eats the response from suggest.vm one line at a time and displays it in a nice (ehem) dropdown list.</p>
<h3>4. Solr schema and config</h3>
<p>Now the juice of our solution lies in the AC core itself. It has a <a href="https://github.com/cominvent/autocomplete/blob/master/solr-home/ac/conf/schema.xml" target="_blank">schema</a> which matches the input data, and this should be a good starting point for you too. Apart from the fields we return, we also have some special fields which are only searched. These are <strong>textnge</strong>, <strong>textng</strong>, <strong>textphon</strong>, <strong>extrasearch</strong> and <strong>phonetic</strong>. They are copied from <strong>textsuggest</strong>, but have different processing, which can be seen from the fieldTypes:</p>
<p style="padding-left: 30px;">textnge uses fieldType autocomplete_edge which will match only from the left edge of the suggestion text. For this we use KeywordTokenizerFactory and EdgeNGramFilterFactory along with some regex cleansing.</p>
<p style="padding-left: 30px;">textng uses fieldType autocomplete_ngram which matches from the start of every word, so that you can get right-truncated suggestions for any word in the text, not only from the first word. The main difference from textnge is the tokenizer which is StandardTokenizerFactory, thus the EdgeNGram filter will produce N-grams for every single token.</p>
<p style="padding-left: 30px;">textphon and phonetic uses fieldType text_phonetic_do which applies the DoubleMetaphoneFilterFactory phonetic filter. This lets a query for &#8220;muhammed&#8221; also match &#8220;mohammad&#8221; etc. Note that it does not make sense to combine N-gram and phonetics, so we only compute phonetic normalization per whole word. The difference between the two fields is how they get populated &#8211; textphon gets populated with a copyField from textsuggest, while &#8220;phonetic&#8221; must be explicitly fed in the content. This is to let you choose whether phonetics should apply to all suggestions or just some.</p>
<p style="padding-left: 30px;">Finally, the field extrasearch is simply queried in addition to textsuggest, and is a way to match hidden text that is not displayed. In our example we add country codes to this field so you can get a match for &#8220;Sweden&#8221; by typing &#8220;SE&#8221;. This field could also contain your hand-edited list of synonyms.</p>
<h3><strong>solrconfig.xml</strong></h3>
<p><strong></strong>In our <a href="https://github.com/cominvent/autocomplete/blob/master/solr-home/ac/conf/solrconfig.xml" target="_blank">solrconfig</a> we have setup two request handler configs, one for our &#8220;/browse&#8221; interface, for testing, and one &#8220;ac&#8221; without the Velocity stuff. In real life we would not have a &#8220;/browse&#8221; section because your main search index would reside in another core and probably contain other data than the autoComplete data. In our search GUI, if you click the search button you will get a normal search result directly from the &#8220;ac&#8221; core, so don&#8217;t be confused by that <img src='http://www.cominvent.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' />  Let&#8217;s look at the config: <a href="http://www.cominvent.com/wp-content/uploads/2012/01/Screen-Shot-2012-01-25-at-18.09.21.png"><img class="alignnone size-full wp-image-591" title="Screen Shot 2012-01-25 at 18.09.21" src="http://www.cominvent.com/wp-content/uploads/2012/01/Screen-Shot-2012-01-25-at-18.09.21.png" alt="" width="598" height="551" /></a></p>
<p>We use edismax query parser, and set it up with &#8220;qf=textsuggest^30 extrasearch^30.0 textng^50.0 phonetic^10&#8243; and pf=textnge^50.0. So the most important field is textng, and we weigh down the phonetic results to avoid noise. If you want to always search phonetic, add textphon to qf. The textnge field is searched through pf parameter (PhraseField), to give a boost to exact suggestions matching from beginning of query.</p>
<p>In addition to textual relevance we boost (bf) by popularity (in our case population), and we also do a multiplicative boost (boost) formula to be able to say that countries type entries should be more frequently suggested than the ones with type=Cities. This lets you tune the balance between the various types in a dynamic way without reindexing. You can even change a weight with a URL parameter instead of editing solrconfig.xml.</p>
<h3>5. The example data</h3>
<p>The example data file <a href="https://github.com/cominvent/autocomplete/blob/master/exampledata/ac-example.csv" target="_blank">ac-example.csv</a> is a comma separated file (which you may open in your spreadsheet application if you wish to view it more readable). Its columns match our schema, and you can easily use it as a template for entering you own autosuggest data. To feed the file to Solr you may use any HTTP client you choose, like curl as being done by feed-ac.sh referenced in <a href="https://github.com/cominvent/autocomplete/blob/master/README.TXT" target="_blank">README.TXT</a>. This concludes the walkthrough. Comments welcome!</p>
<h2>References</h2>
<p>[1]: Lucid&#8217;s <a href="http://www.lucidimagination.com/blog/2009/09/08/auto-suggest-from-popular-queries-using-edgengrams/">blog post</a> about AutoComplete using edgeNgram</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2012/01/25/super-flexible-autocomplete-with-solr/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Solr 3.5 released</title>
		<link>http://www.cominvent.com/2011/11/27/solr-3-5-released/</link>
		<comments>http://www.cominvent.com/2011/11/27/solr-3-5-released/#comments</comments>
		<pubDate>Sat, 26 Nov 2011 23:32:15 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Search technology]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Solr 3.5]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=569</guid>
		<description><![CDATA[Today a new version of Apache Solr was released, version 3.5.0. Here&#8217;s the release statement from the Lucene PMC: The Lucene PMC is pleased to announce the release of Apache Solr 3.5.0! See the CHANGES.txt file included with the release for &#8230; <a href="http://www.cominvent.com/2011/11/27/solr-3-5-released/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>Today a new version of Apache Solr was released, version 3.5.0. Here&#8217;s the release statement from the Lucene PMC:</p>
<table width="80%">
<tbody>
<tr bgcolor="#CCCCCC">
<td>The Lucene PMC is pleased to announce the release of <a href="http://www.apache.org/dyn/closer.cgi/lucene/solr">Apache Solr 3.5.0</a>!</p>
<p>See the <a href="http://svn.apache.org/viewvc/lucene/dev/branches/lucene_solr_3_5/solr/CHANGES.txt?view=co">CHANGES.txt</a> file included with the release for a full list of details.</p>
<p>Solr 3.5.0 Release Highlights:</p>
<ul>
<li>Bug fixes and improvements from Apache Lucene 3.5.0, including a very substantial (3-5X) RAM reduction required to hold the terms index on opening an IndexReader. (<a href="https://issues.apache.org/jira/browse/LUCENE-2205">LUCENE-2205</a>)</li>
<li>Added support for distributed result grouping. (<a href="https://issues.apache.org/jira/browse/SOLR-2066">SOLR-2066</a>, <a href="https://issues.apache.org/jira/browse/SOLR-2776">SOLR-2776</a>)</li>
<li>Added support for Hunspell stemmer TokenFilter supporting stemming for 99 languages. (<a href="https://issues.apache.org/jira/browse/SOLR-2769">SOLR-2769</a>)</li>
<li>A new contrib module &#8220;langid&#8221; adds language identification capabilities as an Update Processor, using Tika&#8217;s LanguageIdentifier or Cybozu language-detection library (<a href="https://issues.apache.org/jira/browse/SOLR-1979">SOLR-1979</a>)</li>
<li>Numeric types including Trie and date types now support sortMissingFirst/Last. (<a href="https://issues.apache.org/jira/browse/SOLR-2881">SOLR-2881</a>)</li>
<li>Added hl.q parameter. It is optional and if it is specified, it overrides q parameter in Highlighter. (<a href="https://issues.apache.org/jira/browse/SOLR-1926">SOLR-1926</a>)</li>
<li>Several minor bugfixes like date parsing for years from 0001-1000, ignored configurations when using QueryAnalyzer with SpellCheckComponent and many more. See CHANGES.txt entries for full details.</li>
</ul>
</td>
</tr>
</tbody>
</table>
<p>Contributions from Cominvent include LanguageIdentifier, Plugging in Hunspell stemmer in Solr and <a href="https://issues.apache.org/jira/browse/SOLR-2742" target="_blank">SOLR-2742</a> which makes commitWithin more accessible through the SolrJ APIs. Also, Apache Tika is upgraded to version 0.10, fixing several bugs in parsing PDFs and Office documents.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2011/11/27/solr-3-5-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
