Saved in:
Main Authors: | , |
---|---|
Format: | Book |
Language: | English |
Published: |
Birmingham [u.a.]
Packt Publ. [u.a.]
2011
|
Edition: | 2. publ. |
Series: | Community experience distilled
|
Subjects: | |
Links: | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024632134&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
Physical Description: | IX, 393 S. graph. Darst. |
ISBN: | 9781849516068 |
Staff View
MARC
LEADER | 00000nam a2200000 c 4500 | ||
---|---|---|---|
001 | BV039771143 | ||
003 | DE-604 | ||
005 | 20130225 | ||
007 | t| | ||
008 | 111220s2011 xx d||| |||| 00||| eng d | ||
020 | |a 9781849516068 |9 978-1-84951-606-8 | ||
035 | |a (OCoLC)773975642 | ||
035 | |a (DE-599)HBZHT017070106 | ||
040 | |a DE-604 |b ger |e rakwb | ||
041 | 0 | |a eng | |
049 | |a DE-523 |a DE-473 | ||
084 | |a ST 252 |0 (DE-625)143627: |2 rvk | ||
084 | |a ST 253 |0 (DE-625)143628: |2 rvk | ||
100 | 1 | |a Smiley, David |e Verfasser |0 (DE-588)139477330 |4 aut | |
245 | 1 | 0 | |a Apache Solr 3 Enterprise Search Server |b enhance your search with faceted navigation, result highlighting, relevancy renked sorting, and more |c David Smiley ; Eric Pugh |
250 | |a 2. publ. | ||
264 | 1 | |a Birmingham [u.a.] |b Packt Publ. [u.a.] |c 2011 | |
300 | |a IX, 393 S. |b graph. Darst. | ||
336 | |b txt |2 rdacontent | ||
337 | |b n |2 rdamedia | ||
338 | |b nc |2 rdacarrier | ||
490 | 0 | |a Community experience distilled | |
650 | 0 | 7 | |a Lucene |0 (DE-588)4800725-0 |2 gnd |9 rswk-swf |
689 | 0 | 0 | |a Lucene |0 (DE-588)4800725-0 |D s |
689 | 0 | |5 DE-604 | |
700 | 1 | |a Pugh, Eric |e Verfasser |4 aut | |
856 | 4 | 2 | |m Digitalisierung UB Bamberg |q application/pdf |u http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024632134&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |3 Inhaltsverzeichnis |
943 | 1 | |a oai:aleph.bib-bvb.de:BVB01-024632134 |
Record in the Search Index
_version_ | 1819264730083098624 |
---|---|
adam_text | Table
of
Contents
Preface
Chapter
1:
Quick Starting Solr
An introduction to Solr
7
Lucene, the underlying engine
8
Solr, a Lucene-based search server
9
Comparison to database technology
10
Getting started
11
Solr s installation directory structure
12
Solr s home directory and Solr cores
14
Running Solr
15
A quick tour of Solr
16
Loading sample data
18
A simple query
20
Some statistics
23
The sample browse interface
24
Configuration files
25
Resources outside this book
27
Summary
28
Chapter
2:
Schema and Text Analysis
___________________________29
MusicBrainz.org
30
One combined index or separate indices
31
One combined index
32
Problems with using a single combined index
33
Separate indices
34
Schema design
35
Step
1 :
Determine which searches are going to be powered by Solr
36
Step
2:
Determine the entities returned from each search
36
Step
3:
Denormalize related data
37
Table
of
Contents
Denormalizing—
one-to-one associated data
37
Denormalizing
—
one-to-many associated data
38
Step
4:
(Optional) Omit the inclusion of fields only used in search results
39
The schema.xml file
40
Defining field types
41
Built-in field type classes
42
Numbers and dates
42
Geospatial
43
Field options
43
Field definitions
44
Dynamic field definitions
45
Our MusicBrainz field definitions
46
Copying fields
48
The unique key
49
The default search field and query operator
49
Text analysis
50
Configuration
51
Experimenting with text analysis
54
Character filters
55
Tokenization
57
WordDelimiterFilter
59
Stemming
61
Correcting and augmenting stemming
62
Synonyms
63
Index-time versus query-time, and to expand or not
64
Stop words
65
Phonetic sounds-like analysis
66
Substring indexing and wildcards
67
ReversedWildcardFilter
68
N-grams
69
N-gram costs
70
Sorting Text
71
Miscellaneous token filters
72
Summary
73
Chapter
3:
Indexing Data
75
Communicating with Solr
76
Direct HTTP or a convenient client API
76
Push data to Solr or have Solr pull it
76
Data formats
76
HTTP POSTing options to Solr
77
Remote streaming
79
Solr s
Update-XML
format
--------------------------------------------------------—
Γ
iil
------____________________________
80
Table of
Contents
Deleting documents
81
Commit, optimize, and rollback
82
Sending CSV formatted data to Solr
84
Configuration options
86
The Data Import Handler Framework
87
Setup
88
The development console
89
Writing a DIH configuration file
90
Data Sources
90
Entity processors
91
Fields and transformers
92
Example DIH configurations
94
Importing from databases
94
Importing XML from a file with XSLT
96
Importing multiple rich document files (crawling)
97
Importing commands
98
Delta imports
99
Indexing documents with Solr Cell
100
Extracting text and metadata from files
100
Configuring Solr
101
Solr Cell parameters
102
Extracting karaoke lyrics
104
Indexing richer documents
106
Update request processors
109
Summary
110
Chapter
4:
Searching
_________________________________________
11[
Your first search, a walk-through
112
Solr s generic XML structured data representation
114
Solr s XML response format
115
Parsing the URL
116
Request handlers
117
Query parameters
119
Search criteria related parameters
119
Result pagination related parameters
120
Output related parameters
121
Diagnostic related parameters
121
Query parsers and local-params
122
Query syntax (the lucene query parser)
123
Matching all the documents
125
Mandatory, prohibited, and optional clauses
125
Boolean operators
126
Sub-queries
127
-------------------------------------------------------------------
[Ш]
--------------------------------------------------------------------
Table
of
Contents
Limitations of prohibited clauses in sub-queries
128
Field qualifier
128
Phrase queries and term proximity
129
Wildcard queries
129
Fuzzy queries
131
Range queries
131
Date math 1
32
Score boosting
133
Existence (and non-existence) queries
134
Escaping special characters
134
The Dismax query parser (part
1 ) 135
Searching multiple fields
137
Limited query syntax
137
Min-should-match
138
Basic rules
138
Multiple rules
139
What to choose
140
A default search
140
Filtering
141
Sorting
142
Geos
patia
I sea re
h
143
Indexing locations
143
Filtering by distance
144
Sorting by distance
145
Summary
146
Chapter
5:
Search Relevancy
____________________________147
Scoring
148
Query-time and index-time boosting
149
Troubleshooting queries and scoring
149
Dismax query parser (part
2) 151
Lucene s DisjunctionMaxQuery
152
Boosting: Automatic phrase boosting
153
Configuring automatic phrase boosting
153
Phrase slop configuration
154
Partial phrase boosting
154
Boosting: Boost queries
155
Boosting: Boost functions
156
Add or multiply boosts?
157
Function queries
158
Field references
159
Function reference
160
Mathematical primitives
161
Other math
161
----------------------------------------------------------
[¡v]
-----------------------------------------------------------
Table of
Contents
ord
and rord
162
Miscellaneous functions
162
Function query boosting
164
Formula: Logarithm
164
Formula: Inverse reciprocal
165
Formula: Reciprocal
167
Formula: Linear
168
How to boost based on an increasing numeric field
168
Step by step.
.. 169
External field values
170
How to boost based on recent dates
170
Step by step...
170
Summary
171
Chapter
6:
Faceting
__________________________________________173
A quick example: Faceting release types
174
MusicBrainz schema changes
176
Field requirements
178
Types of faceting
178
Faceting field values
179
Alphabetic range bucketing
181
Faceting numeric and date ranges
182
Range facet parameters
185
Facet queries
187
Building a filter query from a facet
188
Field value filter queries
189
Facet range filter queries
189
Excluding filters (multi-select faceting)
190
Hierarchical faceting
194
Summary
196
Chapter
7:
Search Components
________________________________197
About components
198
The Highlight component
200
A highlighting example
200
Highlighting configuration
202
The regex
fragmenter
205
The fast vector highlighter with multi-colored highlighting
205
The SpellCheck component
207
Schema configuration
208
Configuration in solrconfig.xml
209
Configuring spellcheckers (dictionaries)
211
Processing of the
q
parameter
213
Processing of the spellcheck.q parameter
213
Building the dictionary from its source
214
------------------------------------
[v]
------------------------------------
Table
of
Contents
sunspot_rails gem
314
Setting up MyFaves project
315
Populating MyFaves relational database from Solr 316
Build Solr indexes from a relational database
318
Complete MyFaves website 320
Which Rails/Ruby library should I use?
322
Nutch for crawling web pages
323
Maintaining document security with ManifoldCF
324
Connectors
325
Putting ManifoldCF to use
325
Summary
328
Chapter
10:
Scaling Solr
______________________________________329
Tuning complex systems
330
Testing Solr performance with SolrMeter
332
Optimizing a single Solr server (Scale up)
334
Configuring JVM settings to improve memory usage
334
MMapDirectoryFactory to leverage additional virtual memory
335
Enabling downstream HTTP caching
335
Solr caching
338
Tuning caches
339
Indexing performance
340
Designing the schema
340
Sending data to Solr in bulk
341
Don t overlap commits
342
Disabling unique key checking
343
Index optimization factors
343
Enhancing faceting performance
345
Using term vectors
345
Improving phrase search performance
346
Moving to multiple Solr servers (Scale horizontally)
348
Replication
349
Starting multiple Solr servers
349
Configuring replication
351
Load balancing searches across slaves
352
Indexing into the master server
352
Configuring slaves
353
Configuring load balancing
354
Sharding indexes
356
Assigning documents to shards
357
Searching across shards (distributed search)
358
Combining replication and sharding (Scale deep)
360
Near real time search
362
Where next for scaling Solr?
363
Summary
364
------------------------------------------[viii]-------------------------------------------
Table of Contents
Appendix: Search Quick Reference
____________________________365
Quick reference
366
Index
369
|
any_adam_object | 1 |
author | Smiley, David Pugh, Eric |
author_GND | (DE-588)139477330 |
author_facet | Smiley, David Pugh, Eric |
author_role | aut aut |
author_sort | Smiley, David |
author_variant | d s ds e p ep |
building | Verbundindex |
bvnumber | BV039771143 |
classification_rvk | ST 252 ST 253 |
ctrlnum | (OCoLC)773975642 (DE-599)HBZHT017070106 |
discipline | Informatik |
edition | 2. publ. |
format | Book |
fullrecord | <?xml version="1.0" encoding="UTF-8"?><collection xmlns="http://www.loc.gov/MARC21/slim"><record><leader>01450nam a2200361 c 4500</leader><controlfield tag="001">BV039771143</controlfield><controlfield tag="003">DE-604</controlfield><controlfield tag="005">20130225 </controlfield><controlfield tag="007">t|</controlfield><controlfield tag="008">111220s2011 xx d||| |||| 00||| eng d</controlfield><datafield tag="020" ind1=" " ind2=" "><subfield code="a">9781849516068</subfield><subfield code="9">978-1-84951-606-8</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(OCoLC)773975642</subfield></datafield><datafield tag="035" ind1=" " ind2=" "><subfield code="a">(DE-599)HBZHT017070106</subfield></datafield><datafield tag="040" ind1=" " ind2=" "><subfield code="a">DE-604</subfield><subfield code="b">ger</subfield><subfield code="e">rakwb</subfield></datafield><datafield tag="041" ind1="0" ind2=" "><subfield code="a">eng</subfield></datafield><datafield tag="049" ind1=" " ind2=" "><subfield code="a">DE-523</subfield><subfield code="a">DE-473</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 252</subfield><subfield code="0">(DE-625)143627:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="084" ind1=" " ind2=" "><subfield code="a">ST 253</subfield><subfield code="0">(DE-625)143628:</subfield><subfield code="2">rvk</subfield></datafield><datafield tag="100" ind1="1" ind2=" "><subfield code="a">Smiley, David</subfield><subfield code="e">Verfasser</subfield><subfield code="0">(DE-588)139477330</subfield><subfield code="4">aut</subfield></datafield><datafield tag="245" ind1="1" ind2="0"><subfield code="a">Apache Solr 3 Enterprise Search Server</subfield><subfield code="b">enhance your search with faceted navigation, result highlighting, relevancy renked sorting, and more</subfield><subfield code="c">David Smiley ; Eric Pugh</subfield></datafield><datafield tag="250" ind1=" " ind2=" "><subfield code="a">2. publ.</subfield></datafield><datafield tag="264" ind1=" " ind2="1"><subfield code="a">Birmingham [u.a.]</subfield><subfield code="b">Packt Publ. [u.a.]</subfield><subfield code="c">2011</subfield></datafield><datafield tag="300" ind1=" " ind2=" "><subfield code="a">IX, 393 S.</subfield><subfield code="b">graph. Darst.</subfield></datafield><datafield tag="336" ind1=" " ind2=" "><subfield code="b">txt</subfield><subfield code="2">rdacontent</subfield></datafield><datafield tag="337" ind1=" " ind2=" "><subfield code="b">n</subfield><subfield code="2">rdamedia</subfield></datafield><datafield tag="338" ind1=" " ind2=" "><subfield code="b">nc</subfield><subfield code="2">rdacarrier</subfield></datafield><datafield tag="490" ind1="0" ind2=" "><subfield code="a">Community experience distilled</subfield></datafield><datafield tag="650" ind1="0" ind2="7"><subfield code="a">Lucene</subfield><subfield code="0">(DE-588)4800725-0</subfield><subfield code="2">gnd</subfield><subfield code="9">rswk-swf</subfield></datafield><datafield tag="689" ind1="0" ind2="0"><subfield code="a">Lucene</subfield><subfield code="0">(DE-588)4800725-0</subfield><subfield code="D">s</subfield></datafield><datafield tag="689" ind1="0" ind2=" "><subfield code="5">DE-604</subfield></datafield><datafield tag="700" ind1="1" ind2=" "><subfield code="a">Pugh, Eric</subfield><subfield code="e">Verfasser</subfield><subfield code="4">aut</subfield></datafield><datafield tag="856" ind1="4" ind2="2"><subfield code="m">Digitalisierung UB Bamberg</subfield><subfield code="q">application/pdf</subfield><subfield code="u">http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024632134&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA</subfield><subfield code="3">Inhaltsverzeichnis</subfield></datafield><datafield tag="943" ind1="1" ind2=" "><subfield code="a">oai:aleph.bib-bvb.de:BVB01-024632134</subfield></datafield></record></collection> |
id | DE-604.BV039771143 |
illustrated | Illustrated |
indexdate | 2024-12-20T16:02:12Z |
institution | BVB |
isbn | 9781849516068 |
language | English |
oai_aleph_id | oai:aleph.bib-bvb.de:BVB01-024632134 |
oclc_num | 773975642 |
open_access_boolean | |
owner | DE-523 DE-473 DE-BY-UBG |
owner_facet | DE-523 DE-473 DE-BY-UBG |
physical | IX, 393 S. graph. Darst. |
publishDate | 2011 |
publishDateSearch | 2011 |
publishDateSort | 2011 |
publisher | Packt Publ. [u.a.] |
record_format | marc |
series2 | Community experience distilled |
spellingShingle | Smiley, David Pugh, Eric Apache Solr 3 Enterprise Search Server enhance your search with faceted navigation, result highlighting, relevancy renked sorting, and more Lucene (DE-588)4800725-0 gnd |
subject_GND | (DE-588)4800725-0 |
title | Apache Solr 3 Enterprise Search Server enhance your search with faceted navigation, result highlighting, relevancy renked sorting, and more |
title_auth | Apache Solr 3 Enterprise Search Server enhance your search with faceted navigation, result highlighting, relevancy renked sorting, and more |
title_exact_search | Apache Solr 3 Enterprise Search Server enhance your search with faceted navigation, result highlighting, relevancy renked sorting, and more |
title_full | Apache Solr 3 Enterprise Search Server enhance your search with faceted navigation, result highlighting, relevancy renked sorting, and more David Smiley ; Eric Pugh |
title_fullStr | Apache Solr 3 Enterprise Search Server enhance your search with faceted navigation, result highlighting, relevancy renked sorting, and more David Smiley ; Eric Pugh |
title_full_unstemmed | Apache Solr 3 Enterprise Search Server enhance your search with faceted navigation, result highlighting, relevancy renked sorting, and more David Smiley ; Eric Pugh |
title_short | Apache Solr 3 Enterprise Search Server |
title_sort | apache solr 3 enterprise search server enhance your search with faceted navigation result highlighting relevancy renked sorting and more |
title_sub | enhance your search with faceted navigation, result highlighting, relevancy renked sorting, and more |
topic | Lucene (DE-588)4800725-0 gnd |
topic_facet | Lucene |
url | http://bvbr.bib-bvb.de:8991/F?func=service&doc_library=BVB01&local_base=BVB01&doc_number=024632134&sequence=000002&line_number=0001&func_code=DB_RECORDS&service_type=MEDIA |
work_keys_str_mv | AT smileydavid apachesolr3enterprisesearchserverenhanceyoursearchwithfacetednavigationresulthighlightingrelevancyrenkedsortingandmore AT pugheric apachesolr3enterprisesearchserverenhanceyoursearchwithfacetednavigationresulthighlightingrelevancyrenkedsortingandmore |