Apache Solr is an opensource search platform powered by Apache Lucene written in Java. Solr is standalone search server with REST-like API. We index documents in it via JSON, XML, CSV or binary over HTTP. We query it via HTTP GET and receive JSON, XML, CSV or binay results.
The following are the features of Apache Solr
- Advanced Full-Text Search Capabilities
- Optimized for High Volumn Traffic
- Standards Based Open Interfaces - XML, JSON and HTTP
- Comprehensive Administration Interfaces
- Easy Monitoring
- Highly Scalable and Fault Tolerant
- Flexible and Adaptable with easy configuration
- Near Real-Time Indexing
- Extensible Plugin Architecture
Installing Apache Solr:
As of this writing, the latest & stable release of Apache Solr is 6.0.0. Download Apache Solr from http://lucene.apache.org/solr/mirrors-solr-latest-redir.html to install into your system/server. Once its downloaded, extract the file into some location and cd into that directory.
The bin folder will have the scripts to start and stop the server.
The example folder will have few example files and the same example files we will be using to demonstrate how Solr indexes the data.
The server folder contains the logs folder where all the Solr logs are written. It will be very helpful to check the logs for any error during indexing.
The solr folder under server holds different collections or cores which we are going to create. The configuration and data for each of the collection or core are stored in the respective collection or core folder.
Before we start the solr instance we must validate the JAVA_HOME is set on the machine. Apache Solr comes with an inbuilt Jetty server, we can start server using the command line script. Go to the bin directory from the command prompt and issue the following command
1 solr start
This will start the Solr server under the default port 8983. In case if we want to run Solr on different port then we can specify the port using the following command.
1 solr start -p <port_number>
To validate whether our Solr instance is running or not, open the following url in browser