<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
		xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	xmlns:media="http://search.yahoo.com/mrss/"
>

<channel>
	<title>Cominvent AS - Enterprise search consultants &#187; tika</title>
	<atom:link href="http://www.cominvent.com/tag/tika/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.cominvent.com</link>
	<description>Search, and you will find!</description>
	<lastBuildDate>Thu, 26 Jan 2012 12:33:47 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<copyright>2006-2007 </copyright>
	<managingEditor>ci@cominvent.com (Cominvent AS - Enterprise search consultants)</managingEditor>
	<webMaster>ci@cominvent.com (Cominvent AS - Enterprise search consultants)</webMaster>
	<image>
		<url>http://www.cominvent.com/wp-content/plugins/podpress/images/powered_by_podpress.jpg</url>
		<title>Cominvent AS - Enterprise search consultants</title>
		<link>http://www.cominvent.com</link>
		<width>144</width>
		<height>144</height>
	</image>
	<itunes:subtitle></itunes:subtitle>
	<itunes:summary>Search, and you will find!</itunes:summary>
	<itunes:keywords></itunes:keywords>
	<itunes:category text="Society &#38; Culture" />
	<itunes:author>Cominvent AS - Enterprise search consultants</itunes:author>
	<itunes:owner>
		<itunes:name>Cominvent AS - Enterprise search consultants</itunes:name>
		<itunes:email>ci@cominvent.com</itunes:email>
	</itunes:owner>
	<itunes:block>no</itunes:block>
	<itunes:explicit>no</itunes:explicit>
	<itunes:image href="http://www.cominvent.com/wp-content/plugins/podpress/images/powered_by_podpress_large.jpg" />
		<item>
		<title>Apache Solr 1.4 finally released</title>
		<link>http://www.cominvent.com/2009/11/11/apache-solr-1-4-finally-released/</link>
		<comments>http://www.cominvent.com/2009/11/11/apache-solr-1-4-finally-released/#comments</comments>
		<pubDate>Wed, 11 Nov 2009 12:30:42 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Search technology]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[apache]]></category>
		<category><![CDATA[enterprise search]]></category>
		<category><![CDATA[excel]]></category>
		<category><![CDATA[faceting]]></category>
		<category><![CDATA[lucene]]></category>
		<category><![CDATA[nested queries]]></category>
		<category><![CDATA[ODF]]></category>
		<category><![CDATA[office]]></category>
		<category><![CDATA[pdf]]></category>
		<category><![CDATA[range queries]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[solr 1.4]]></category>
		<category><![CDATA[tika]]></category>
		<category><![CDATA[word]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=197</guid>
		<description><![CDATA[Finally, after long delays, Apache Solr version 1.4 is released. The long delay was mainly due to very strict quality standards, which made it necessary to wait until some serious known bugs were dealt with. Also read my previous article on what&#8217;s new in this version. Here is the official release statement: Apache Solr 1.4 [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-120" title="apache_solr_logo" src="http://www.cominvent.com/wp-content/uploads/2009/06/apache_solr_logo.gif" alt="apache_solr_logo" width="170" height="94" />Finally, after long delays, Apache Solr version 1.4 is released. The long delay was mainly due to very strict quality standards, which made it necessary to wait until some serious known bugs were dealt with. Also read my <a href="http://www.cominvent.com/2009/06/22/solr-1-4-with-nice-improvements-solr-1-4-med-kjekke-forbedringer/">previous article</a> on what&#8217;s new in this version. Here is the official release statement:</p>
<pre><span style="font-family: courier,monospace;">Apache Solr 1.4 has been released and is now available for
public download! <a href="http://www.apache.org/dyn/closer.cgi/lucene/solr/" target="_blank">http://www.apache.org/dyn/closer.cgi/lucene/solr/</a>

<span id="more-197"></span>Solr is the popular, blazing fast open source enterprise search
platform from the Apache Lucene project. Its major features include
powerful full-text search, hit highlighting, faceted search,
dynamic clustering, database integration, and rich document (e.g.,
Word, PDF) handling. Solr is highly scalable, providing distributed
search and index replication, and it powers the search and
navigation features of many of the world’s largest internet sites.

Solr is written in Java and runs as a standalone full-text search
server within a servlet container such as Tomcat. Solr uses the
Lucene Java search library at its core for full-text indexing and
search, and has REST-like HTTP/XML and JSON APIs that make it easy
to use from virtually any programming language. Solr’s powerful
external configuration allows it to be tailored to almost any type
of application without Java coding, and it has an extensive plugin
architecture when more advanced customization is required.

New Solr 1.4 features include
- Major performance enhancements in indexing, searching, and
  faceting
- Revamped all-Java index replication that’s simple to configure
  and can replicate config files
- Greatly improved database integration via the DataImportHandler
- Rich document processing (Word, PDF, HTML) via Apache Tika
- Dynamic search results clustering via Carrot2
- Multi-select faceting (support for multiple items in a single
  category to be selected)
- Many powerful query enhancements, including ranges over arbitrary
  functions, and nested queries of different syntaxes
- Many other plugins including Terms for auto-suggest, Statistics,
  TermVectors, Deduplication

Getting Started
---------------
New to Solr? Follow the steps below to get up and running ASAP.

1. Download Solr at <a href="http://www.apache.org/dyn/closer.cgi/lucene/solr/" target="_blank">http://www.apache.org/dyn/closer.cgi/lucene/solr/</a>
2. Check out the tutorial at <a href="http://lucene.apache.org/solr/tutorial.html" target="_blank">http://lucene.apache.org/solr/tutorial.html</a>
3. Read the Solr wiki (<a href="http://wiki.apache.org/solr" target="_blank">http://wiki.apache.org/solr</a>) to learn more
4. Join the community by subscribing to solr-user@lucene.apache.org
5. Give Back (Optional, but encouraged!)
   See <a href="http://wiki.apache.org/solr/HowToContribute" target="_blank">http://wiki.apache.org/solr/HowToContribute</a>

For more information on Apache Solr, see <a href="http://lucene.apache.org/solr" target="_blank">http://lucene.apache.org/solr</a>
</span></pre>
<p>Release-declaration sourced from <a href="http://lucene.grantingersoll.com/2009/11/10/apache-solr-1-4-0-offically-released/" target="_blank">Grant Ingersoll&#8217;s blog</a>.</p>
<p>Cominvent delivers <a href="http://www.solrtraining.com/">training</a> on Solr 1.4.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2009/11/11/apache-solr-1-4-finally-released/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Solr 1.4 with nice improvements</title>
		<link>http://www.cominvent.com/2009/06/22/solr-1-4-with-nice-improvements-solr-1-4-med-kjekke-forbedringer/</link>
		<comments>http://www.cominvent.com/2009/06/22/solr-1-4-with-nice-improvements-solr-1-4-med-kjekke-forbedringer/#comments</comments>
		<pubDate>Mon, 22 Jun 2009 22:04:48 +0000</pubDate>
		<dc:creator>janhoy</dc:creator>
				<category><![CDATA[Open Source]]></category>
		<category><![CDATA[Search technology]]></category>
		<category><![CDATA[Solr]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[performance]]></category>
		<category><![CDATA[replication]]></category>
		<category><![CDATA[scaling]]></category>
		<category><![CDATA[solr 1.4]]></category>
		<category><![CDATA[tika]]></category>

		<guid isPermaLink="false">http://www.cominvent.com/?p=147</guid>
		<description><![CDATA[The Apache Solr 1.3 search server is very capable and stable, so there is nothing that should keep you from deploying it just because a new version is around the corner (within a few months that is). Upgrades should be smooth as well. So what are some of the improvements in 1.4? Apache Tika integration [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignright size-full wp-image-120" title="apache_solr_logo" src="http://www.cominvent.com/wp-content/uploads/2009/06/apache_solr_logo.gif" alt="apache_solr_logo" />The Apache Solr 1.3 search server is very capable and stable, so there is nothing that should keep you from deploying it just because a new version is around the corner (within a few months that is). Upgrades should be smooth as well.</p>
<p>So what are some of the improvements in 1.4?<span id="more-147"></span></p>
<h2>Apache Tika integration</h2>
<p>One of the more useful news for those doing intranet search is the integration with Tika (or Solr Cell), which is a document parsing component which can read MS Office formats, PDF and more.</p>
<h2>Improved performance</h2>
<p>You will get improved faceting speed for free, and with a small change in the schema, you will also benefit from greatly improved integer (and float, date) range searches. This is due to a smarter internal implementation, so instead of expanding an integer range into an OR of ALL discrete string values in the range. Say you want to search the range 0-1234. In v1.3 that would expand into an OR of (0, 1, 2, 3, 4&#8230;1229, 1230, 1231, 1232, 1233, 1234). For simplicity lets pretend the implementation split along 10-decimalsthe range prefixed th for thousand, hu for hundred, te for ten, and substitute &#8220;full&#8221; ranges, then the new OR will be (th0, hu0, hu1, te0, te1, te2, 1230, 1231, 1232, 1233, 1234), thereby reducing number of terms from thousands to tens.</p>
<h2>Easier index replication</h2>
<p>Replicating your index for large systems has not been for the amateur so far, needing to setup all nodes manually (no installer support) as well as scripts for rsyncing index to slaves. In 1.4 the default index replication is by Java process-to-process, so it is easier to setup, and will also work on Windows!</p>
<h2>And more</h2>
<p>There are also more news, such as Lucene improvements which Solr also benefits from. Read more <a href="http://issues.apache.org/jira/browse/SOLR/fixforversion/12313351" target="_blank">here</a>.</p>
<p>So look out for Solr 1.4 on a website near you&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.cominvent.com/2009/06/22/solr-1-4-with-nice-improvements-solr-1-4-med-kjekke-forbedringer/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

