Aug, 2015
selvam's picture

2 Comments | Submitted by selvam

Solr 5.1 introduced a revolutionary Streaming API. With Solr 5.2, you get Streaming Expressions on top of it. Ever wondered on how to run nested queries in SOLR or running parallel computing capabilities, this could be the answer. 

Streaming Expressions provide a simple query language for SolrCloud that merges search with parallel computing. Under the covers Streaming Expressions are backed by a java Streaming API that provides a fast map/reduce implementation for SolrCloud. Streaming Expressions are composed of functions. All functions behave like Streams, which means that they don't hold all the data in memory at once. Read more about the basics here https://cwiki.apache.org/confluence/display/solr/Streaming+Expressions


Assuming a debian based system, say Ubuntu 12.04 or 14.04. If you have not installed Solr 5.2, go grap latest codebase (For eg http://apache.mirror1.spango.com/lucene/solr/5.2.1/), extract it. 

Setup Solr in cloud mode.

Cloud mode lets you create collection and nodes. See https://cwiki.apache.org/confluence/display/solr/Getting+Started+with+SolrCloud for more details.

bin/solr -e cloud

Enter the port and other details.

To start a single node, use,

bin/solr start -cloud -s example/cloud/node1/solr -p 8983

Streaming API:

Now comes the interesting part. We have the following streaming API functions, 

Read more →

May, 2012
sivaji's picture

0 Comments | Submitted by sivaji

A quick tip for sites using Drupal 7 core search module. It is very common to see Drupal sites counting on core search module to feature native site search functionality. When the volume of the content grows, need for offloading the search to thirdparty server could felt; as core search doesn't scale or designed to yield optimal output.

By means of contrib module, Drupal provides platform to integrate with proven solutions like sphinxLuceneApache Solr, Google Custom Search Engine, etc. 

One important thing that often turns regretful along this process; failure to disable node search module. Remember the core module has been designed in such a way that it can accommodate more than one search module to play at once. Until the node module is disabled from search settings (admin/config/search/settings) , it remains functional and keeps hurting your site performance during every cron. 

If you are sailing in the same boat remember to disable node from active search module. Also be dare to truncate the {search_index} table because you don't need it, if you are not using the node search module.

Also as of Drupal 7.14 there is no way to clear search index from Drupal admin interface. Checkout issue #326062, if you are interested in supporting this feature. 

Read more →

Subscribe to RSS - Search