Indexing configuration - optimizations for Magnolia 5

The indexing configuration shipped with Magnolia so far has never been tweaked/optimized (there is no indexing config file included at all and the searchIndex config in the main jackrabbit configuration file doesn't contain any interesting feature like spellchecker, analyzers, excerpts handling).

See http://wiki.apache.org/jackrabbit/IndexingConfiguration for details on jackrabbitindexing configuration

Following some samples that have been discussed during the unconference at #mconf12 with Jan. We should investigate on which features could be included by default in Magnolia 5.

A few utility classes and sample configurations are included in openmind criteria API

Search index configuration

</SearchIndex>

indexing_configuration.xml sample

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE configuration SYSTEM "http://jackrabbit.apache.org/dtd/indexing-configuration-1.2.dtd">
<configuration xmlns:nt="http://www.jcp.org/jcr/nt/1.0" xmlns:mgnl="http://www.magnolia.info/jcr/mgnl"
xmlns:jcr="http://www.jcp.org/jcr/1.0">


<analyzers>
<analyzer class="org.apache.lucene.analysis.KeywordAnalyzer">
<property>tags</property>
</analyzer>
</analyzers>

<index-rule nodeType="nt:hierarchyNode">
<property boost="10" useInExcerpt="false">title</property>
<property boost="1.0" useInExcerpt="true">text</property>

<property isRegexp="true" nodeScopeIndex="false" useInExcerpt="false">.*:.*</property>
</index-rule>
<index-rule nodeType="mgnl:contentNode">
<property boost="5" nodeScopeIndex="false" useInExcerpt="false">title</property>
<property boost="2" nodeScopeIndex="false" useInExcerpt="true">text</property>

<property isRegexp="true" nodeScopeIndex="false" useInExcerpt="false">.*:.*</property>
</index-rule>


<aggregate primaryType="mgnl:content">

<include primaryType="mgnl:contentNode">nomeoftheareanode/*</include>
</aggregate>


<aggregate primaryType="mgnl:content">
<include>mgnl:creationdate</include>
<include-property>MetaData/mgnl:creationdate</include-property>
</aggregate>
<aggregate primaryType="mgnl:content">
<include>mgnl:lastmodified</include>
<include-property>MetaData/mgnl:lastmodified</include-property>
</aggregate>
<aggregate primaryType="mgnl:content">
<include>mgnl:template</include>
<include-property>MetaData/mgnl:template</include-property>
</aggregate>

</configuration>

Page tree

Search index configuration

indexing_configuration.xml sample

1 Comment

Magnolia International