Apache Solr Ref Guide 4.10

apache-solr-ref-guide-4.10

User Manual:

Open the PDF directly: View PDF .
Page Count: 511

Download
Open PDF In Browser	View PDF

Apache Solr Reference Guide
Covering Apache Solr 4.10

Licensed to the Apache Software Foundation (ASF) under one
or more contributor license agreements. See the NOTICE file
distributed with this work for additional information
regarding copyright ownership. The ASF licenses this file
to you under the Apache License, Version 2.0 (the
"License"); you may not use this file except in compliance
with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing,
software distributed under the License is distributed on an
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
KIND, either express or implied. See the License for the
specific language governing permissions and limitations
under the License.

Apache and the Apache feather logo are trademarks of The Apache Software Foundation. Apache Lucene, Apache
Solr and their respective logos are trademarks of the Apache Software Foundation. Please see the Apache
Trademark Policy for more information.

Apache Solr Reference Guide
This reference guide describes Apache Solr, the open source solution for search. You can download Apache Solr
from the Solr website at http://lucene.apache.org/solr/.
This Guide contains the following sections:
Getting Started: This section guides you through the
installation and setup of Solr.

Searching: This section presents an overview of the
search process in Solr. It describes the main components
used in searches, including request handlers, query
parsers, and response writers. It lists the query parameters
that can be passed to Solr, and it describes features such
as boosting and faceting, which can be used to fine-tune
search results.

Using the Solr Administration User Interface: This
section introduces the Solr Web-based user interface.
From your browser you can view configuration files,
submit queries, view logfile settings and Java
environment settings, and monitor and control distributed
configurations.
The Well-Configured Solr Instance: This section
discusses performance tuning for Solr. It begins with an
Documents, Fields, and Schema Design: This section
overview of the solrconfig.xml file, then tells you how
describes how Solr organizes its data for indexing. It
to configure cores with solr.xml, how to configure the
explains how a Solr schema defines the fields and field
Lucene index writer, and more.
types which Solr uses to organize data within the
document files it indexes.
Managing Solr: This section discusses important topics for
running and monitoring Solr. It describes running Solr in
Understanding Analyzers, Tokenizers, and Filters:
the Apache Tomcat servlet runner and Web server. Other
This section explains how Solr prepares text for indexing
topics include how to back up a Solr instance, and how to
and searching. Analyzers parse text and produce a
run Solr with Java Management Extensions (JMX).
stream of tokens, lexical units used for indexing and
searching. Tokenizers break field data down into tokens. SolrCloud: This section describes the newest and most
Filters perform other transformational or selective work exciting of Solr's new features, SolrCloud, which provides
on token streams.
comprehensive distributed capabilities.
Indexing and Basic Data Operations: This section
describes the indexing process and basic index
operations, such as commit, optimize, and rollback.

Legacy Scaling and Distribution: This section tells you
how to grow a Solr distribution by dividing a large index
into sections called shards, which are then distributed
across multiple servers, or by replicating a single index
across multiple services.
Client APIs: This section tells you how to access Solr
through various client APIs, including JavaScript, JSON,
and Ruby.

Apache Solr Reference Guide 4.10

About This Guide
This guide describes all of the important features and functions of Apache Solr. It is free to download from http://luce
ne.apache.org/solr/.
Designed to provide high-level documentation, this guide is intended to be more encyclopedic and less of a
cookbook. It is structured to address a broad spectrum of needs, ranging from new developers getting started to
well-experienced developers extending their application or troubleshooting. It will be of use at any point in the
application life cycle, for whenever you need authoritative information about Solr.
The material as presented assumes that you are familiar with some basic search concepts and that you can read
XML. It does not assume that you are a Java programmer, although knowledge of Java is helpful when working
directly with Lucene or when developing custom extensions to a Lucene/Solr installation.

Special Inline Notes
Special notes are included throughout these pages.
Note Type
Information

Notes

Tip

Warning

Look & Description
Notes with a blue background are used for information that is important for you to know.

Yellow notes are further clarifications of important points to keep in mind while using Solr.

Notes with a green background are Helpful Tips.

Notes with a red background are warning messages.

Hosts and Port Examples
The default port configured for Solr during the install process is 8983. The samples, URLs and screenshots in this
guide may show different ports, because the port number that Solr uses is configurable. If you have not customized
your installation of Solr, please make sure that you use port 8983 when following the examples, or configure your
own installation to use the port numbers shown in the examples. For information about configuring port numbers
used by Tomcat or Jetty, see Managing Solr.
Similarly, URL examples use 'localhost' throughout; if you are accessing Solr from a location remote to the server
hosting Solr, replace 'localhost' with the proper domain or IP where Solr is running.

Paths
Path information is given relative to solr.home, which is the location under the main Solr installation where Solr's
collections and their conf and data directories are stored. In the default Solr package, solr.home is example/s
olr, which is itself relative to where you unpackaged the application; if you have moved this location for your servlet
container or for another reason, the path to solr.home may be different than the default.

Apache Solr Reference Guide 4.10

Getting Started
Solr makes it easy for programmers to develop sophisticated, high-performance search applications with advanced
features such as faceting (arranging search results in columns with numerical counts of key terms). Solr builds on
another open source search technology: Lucene, a Java library that provides indexing and search technology, as
well as spellchecking, hit highlighting and advanced analysis/tokenization capabilities. Both Solr and Lucene are
managed by the Apache Software Foundation (www.apache.org).
The Lucene search library currently ranks among the top 15 open source projects and is one of the top 5 Apache
projects, with installations at over 4,000 companies. Lucene/Solr downloads have grown nearly ten times over the
past three years, with a current run-rate of over 6,000 downloads a day. The Solr search server, which provides
application builders a ready-to-use search platform on top of the Lucene search library, is the fastest growing
Lucene sub-project. Apache Lucene/Solr offers an attractive alternative to the proprietary licensed search and
discovery software vendors.
This section helps you get Solr up and running quickly, and introduces you to the basic Solr architecture and
features. It covers the following topics:
Installing Solr: A walkthrough of the Solr installation process.
Running Solr: An introduction to running Solr. Includes information on starting up the servers, adding documents,
and running queries.
A Quick Overview: A high-level overview of how Solr works.
A Step Closer: An introduction to Solr's home directory and configuration options.

Installing Solr
This section describes how to install Solr. You can install Solr anywhere that a suitable Java Runtime Environment
(JRE) is available, as detailed below. Currently this includes Linux, OS X, and Microsoft Windows. The instructions
in this section should work for any platform, with a few exceptions for Windows as noted.

Got Java?
You will need the Java Runtime Environment (JRE) version 1.7 or higher. At a command line, check your Java
version like this:
$ java -version
java version "1.7.0_55"
Java(TM) SE Runtime Environment (build 1.7.0_55-b13)
Java HotSpot(TM) 64-Bit Server VM (build 24.55-b03, mixed mode)

The output will vary, but you need to make sure you have version 1.7 or higher. If you don't have the required
version, or if the java command is not found, download and install the latest version from Oracle at http://www.oracle
.com/technetwork/java/javase/downloads/index.html.

Installing Solr
Solr is available from the Solr website at http://lucene.apache.org/solr/.
For Linux/Unix/OSX systems, download the .tgz file. For Microsoft Windows systems, download the .zip file.
Solr runs inside a Java servlet container such as Tomcat, Jetty, or Resin. The Solr distribution includes a working
demonstration server in the Example directory that runs in Jetty. You can use the example server as a template for

Apache Solr Reference Guide 4.10

your own installation, whether or not you are using Jetty as your servlet container. For more information about the
demonstration server, see the Solr Tutorial.
Solr ships with a working Jetty server, with optimized settings for Solr, inside the example directory. It is
recommended that you use the provided Jetty server for optimal performance. If you absolutely must use a
different servlet container then continue to the next section on how to install Solr.
To install Solr

1. Unpack the Solr distribution to your desired location.
2. Stop your Java servlet container.
3. Copy the solr.war file from the Solr distribution to the webapps directory of your servlet container. Do not
change the name of this file: it must be named solr.war.
4. Copy the Solr Home directory solr-4.x.0/example/solr/ from the distribution to your desired Solr
Home location.
5. Start your servlet container, passing to it the location of your Solr Home in one of these ways:
Set the Java system property solr.solr.home to your Solr Home. (for example, using the example
jetty setup: java -Dsolr.solr.home=/some/dir -jar start.jar).
Configure the servlet container so that a JNDI lookup of java:comp/env/solr/home by the Solr
webapp will point to your Solr Home.
Start the servlet container in the directory containing ./solr: the default Solr Home is solr under the
JVM's current working directory ($CWD/solr).
To confirm your installation, go to the Solr Admin page at http://localhost:8983/solr/ . Note that your
servlet container may have started on a different port: check the documentation for your servlet container to
troubleshoot that issue. Also note that if that port is already in use, Solr will not start. In that case, shut down the
servlet container running on that port, or change your Solr port.
For more information about installing and running Solr on different Java servlet containers, see the SolrInstall page
on the Solr Wiki.

Apache Solr Ref Guide 4.10

apache-solr-ref-guide-4.10

Navigation menu

Versions of this User Manual:

Views

Navigation