[ Index ]
 

Code source de Typo3 4.1.3

Accédez au Source d'autres logiciels libres

Classes | Fonctions | Variables | Constantes | Tables

title

Body

[fermer]

/typo3/sysext/indexed_search/ -> class.crawler.php (sommaire)

Crawler hook for indexed search. Works with the "crawler" extension

Author: Kasper Skårhøj <kasperYYYY@typo3.com>
Poids: 969 lignes (33 kb)
Inclus ou requis:0 fois
Référencé: 0 fois
Nécessite: 0 fichiers

Définit 2 classes

tx_indexedsearch_crawler:: (18 méthodes):
  crawler_init()
  crawler_execute()
  crawler_execute_type1()
  crawler_execute_type2()
  crawler_execute_type3()
  crawler_execute_type4()
  cleanUpOldRunningConfigurations()
  checkUrl()
  indexExtUrl()
  indexSingleRecord()
  loadIndexerClass()
  getUidRootLineForClosestTemplate()
  generateNextIndexingTime()
  checkDeniedSuburls()
  addQueueEntryForHook()
  deleteFromIndex()
  processCmdmap_preProcess()
  processDatamap_afterDatabaseOperations()

tx_indexedsearch_files:: (2 méthodes):
  crawler_execute()
  loadIndexerClass()


Classe: tx_indexedsearch_crawler  - X-Ref

Crawler hook for indexed search. Works with the "crawler" extension

crawler_init(&$pObj)   X-Ref
Initialization of crawler hook.
This function is asked for each instance of the crawler and we must check if something is timed to happen and if so put entry(s) in the crawlers log to start processing.
In reality we select indexing configurations and evaluate if any of them needs to run.

param: object        Parent object (tx_crawler lib)
return: void

crawler_execute($params,&$pObj)   X-Ref
Call back function for execution of a log element

param: array        Params from log element. Must contain $params['indexConfigUid']
param: object        Parent object (tx_crawler lib)
return: array        Result array

crawler_execute_type1($cfgRec,&$session_data,$params,&$pObj)   X-Ref
Indexing records from a table

param: array        Indexing Configuration Record
param: array        Session data for the indexing session spread over multiple instances of the script. Passed by reference so changes hereto will be saved for the next call!
param: array        Parameters from the log queue.
param: object        Parent object (from "crawler" extension!)
return: void

crawler_execute_type2($cfgRec,&$session_data,$params,&$pObj)   X-Ref
Indexing files from fileadmin

param: array        Indexing Configuration Record
param: array        Session data for the indexing session spread over multiple instances of the script. Passed by reference so changes hereto will be saved for the next call!
param: array        Parameters from the log queue.
param: object        Parent object (from "crawler" extension!)
return: void

crawler_execute_type3($cfgRec,&$session_data,$params,&$pObj)   X-Ref
Indexing External URLs

param: array        Indexing Configuration Record
param: array        Session data for the indexing session spread over multiple instances of the script. Passed by reference so changes hereto will be saved for the next call!
param: array        Parameters from the log queue.
param: object        Parent object (from "crawler" extension!)
return: void

crawler_execute_type4($cfgRec,&$session_data,$params,&$pObj)   X-Ref
Page tree indexing type

param: array        Indexing Configuration Record
param: array        Session data for the indexing session spread over multiple instances of the script. Passed by reference so changes hereto will be saved for the next call!
param: array        Parameters from the log queue.
param: object        Parent object (from "crawler" extension!)
return: void

cleanUpOldRunningConfigurations()   X-Ref
Look up all old index configurations which are finished and needs to be reset and done

return: void

checkUrl($url,$urlLog,$baseUrl)   X-Ref
Check if an input URL are allowed to be indexed. Depends on whether it is already present in the url log.

param: string        URL string to check
param: array        Array of already indexed URLs (input url is looked up here and must not exist already)
param: string        Base URL of the indexing process (input URL must be "inside" the base URL!)
return: string        Returls the URL if OK, otherwise false

indexExtUrl($url, $pageId, $rl, $cfgUid, $setId)   X-Ref
Indexing External URL

param: string        URL, http://....
param: integer        Page id to relate indexing to.
param: array        Rootline array to relate indexing to
param: integer        Configuration UID
param: integer        Set ID value
return: array        URLs found on this page

indexSingleRecord($r,$cfgRec,$rl=NULL)   X-Ref
Indexing Single Record

param: array        Record to index
param: array        Configuration Record
param: array        Rootline array to relate indexing to
return: void

loadIndexerClass()   X-Ref
Include indexer class.

return: void

getUidRootLineForClosestTemplate($id)   X-Ref
Get rootline for closest TypoScript template root.
Algorithm same as used in Web > Template, Object browser

param: integer        The page id to traverse rootline back from
return: array        Array where the root lines uid values are found.

generateNextIndexingTime($cfgRec)   X-Ref
Generate the unix time stamp for next visit.

param: array        Index configuration record
return: integer        The next time stamp

checkDeniedSuburls($url, $url_deny)   X-Ref
Checks if $url has any of the URls in the $url_deny "list" in it and if so, returns true.

param: string        URL to test
param: string        String where URLs are separated by line-breaks; If any of these strings is the first part of $url, the function returns TRUE (to indicate denial of decend)
return: boolean        TRUE if there is a matching URL (hence, do not index!)

addQueueEntryForHook($cfgRec, $title)   X-Ref
Adding entry in queue for Hook

param: array        Configuration record
param: string        Title/URL
return: void

deleteFromIndex($id)   X-Ref
Deletes all data stored by indexed search for a given page

param: integer        Uid of the page to delete all pHash
return: void

processCmdmap_preProcess($command, $table, $id, $value, &$pObj)   X-Ref
TCEmain hook function for on-the-fly indexing of database records

param: string        TCEmain command
param: string        Table name
param: string        Record ID. If new record its a string pointing to index inside t3lib_tcemain::substNEWwithIDs
param: mixed        Target value (ignored)
param: object        Reference to tcemain calling object
return: void

processDatamap_afterDatabaseOperations($status, $table, $id, $fieldArray, &$pObj)   X-Ref
TCEmain hook function for on-the-fly indexing of database records

param: string        Status "new" or "update"
param: string        Table name
param: string        Record ID. If new record its a string pointing to index inside t3lib_tcemain::substNEWwithIDs
param: array        Field array of updated fields in the operation
param: object        Reference to tcemain calling object
return: void

Classe: tx_indexedsearch_files  - X-Ref

Crawler hook for indexed search. Works with the "crawler" extension
This hook is specifically used to index external files found on pages through the crawler extension.

crawler_execute($params,&$pObj)   X-Ref
Call back function for execution of a log element

param: array        Params from log element.
param: object        Parent object (tx_crawler lib)
return: array        Result array

loadIndexerClass()   X-Ref
Include indexer class.

return: void



Généré le : Sun Nov 25 17:13:16 2007 par Balluche grâce à PHPXref 0.7
  Clicky Web Analytics