Maarten Balliauw 2/1/2008

Indexing Word 2007 (docx) files with Zend_Search_Lucene

Read Original

This article provides a detailed, code-heavy guide on how to use the Zend Framework's PHP port of Apache Lucene to create a search index for Word 2007 (.docx) documents. It covers prerequisites, creating an index, extracting text and metadata from .docx files, and adding document fields for searchability, addressing a gap in available examples for this specific file format.

Indexing Word 2007 (docx) files with Zend_Search_Lucene

Comments

No comments yet

Be the first to share your thoughts!

Browser Extension

Get instant access to AllDevBlogs from your browser

Top of the Week

2
Introducing RSC Explorer
Dan Abramov 1 votes
4
Fragments Dec 11
Martin Fowler 1 votes
5
Adding Type Hints to my Blog
Daniel Feldroy 1 votes
6
Refactoring English: Month 12
Michael Lynch 1 votes
8
10
You Gotta Push If You Wanna Pull
Gunnar Morling 1 votes