ColdFusion PDF file search using cfsearch and SOLR extremely slow -
i have functional working adobe coldfusion application indexing 2k pdf files via solr search , providing expected results - each search query collection takes 25-30 seconds.
this how indexed 2k pdf files solr:
<!--- query database files ---> <cfset getfiles = application.file.getfiles()> <!--- create solr query set ---> <cfset filesquery = querynew(" fileuid , filepath , title , description , fileext , added ")> <!--- create new file query key path , download url ---> <cfoutput query="getfiles"> <cfset ext = trim(getfiles.fileext)> <cfset path = expandpath('/docs/#fileuid#.#ext#')> <cfscript> newrow = queryaddrow(filesquery); querysetcell(filesquery, "fileuid","#fileuid#" ); querysetcell(filesquery, "filepath","#path#" ); querysetcell(filesquery, "title","#filename#" ); querysetcell(filesquery, "description","#description#" ); querysetcell(filesquery, "added","#added#" ); </cfscript> </cfoutput> <!--- index bunch ---> <cfindex query = "filesquery" collection = "resumes" action = "update" type = "file" key = "filepath" title = "title" body = "title, description" custom1 = "fileext" custom2 = "added" category= "file" status = "filestatus">
this how files being searched , (25-30 second) solr search happens:
<!--- imagine form (form.search) terms ---> <cfsearch name = "results" collection = "resumes" criteria = "#form.search# contextpassages = "1" contextbytes = "300" maxrows = "100" contexthighlightbegin = "<strong>" contexthighlightend = " </strong>"> <!--- show (results) query --->
some additional info on project: of files less 1 page in length, there no character cutoff in creating index results solr. have played solr buffer limit within coldfusion administrator no major discernable change in time (currently @ 40). on development vm ms server 2003, 1.86 xeon - adobe coldfusion 9.0.1 , 1gb ram. jvm sun microsytems (14.3-b01). nothing else running server-side, performance should unaffected external factors.
it providing expected , perfect results, not in timely fashion.
you try using cfsolrlib. uses solr api. possible may performance boost bypassing <cfsearch>
Comments
Post a Comment