[ Index ] |
|
Code source de Typo3 4.1.3 |
[Code source] [Imprimer] [Statistiques]
Crawler hook for indexed search. Works with the "crawler" extension
Author: | Kasper Skårhøj <kasperYYYY@typo3.com> |
Poids: | 969 lignes (33 kb) |
Inclus ou requis: | 0 fois |
Référencé: | 0 fois |
Nécessite: | 0 fichiers |
tx_indexedsearch_crawler:: (18 méthodes):
crawler_init()
crawler_execute()
crawler_execute_type1()
crawler_execute_type2()
crawler_execute_type3()
crawler_execute_type4()
cleanUpOldRunningConfigurations()
checkUrl()
indexExtUrl()
indexSingleRecord()
loadIndexerClass()
getUidRootLineForClosestTemplate()
generateNextIndexingTime()
checkDeniedSuburls()
addQueueEntryForHook()
deleteFromIndex()
processCmdmap_preProcess()
processDatamap_afterDatabaseOperations()
tx_indexedsearch_files:: (2 méthodes):
crawler_execute()
loadIndexerClass()
Classe: tx_indexedsearch_crawler - X-Ref
Crawler hook for indexed search. Works with the "crawler" extensioncrawler_init(&$pObj) X-Ref |
Initialization of crawler hook. This function is asked for each instance of the crawler and we must check if something is timed to happen and if so put entry(s) in the crawlers log to start processing. In reality we select indexing configurations and evaluate if any of them needs to run. param: object Parent object (tx_crawler lib) return: void |
crawler_execute($params,&$pObj) X-Ref |
Call back function for execution of a log element param: array Params from log element. Must contain $params['indexConfigUid'] param: object Parent object (tx_crawler lib) return: array Result array |
crawler_execute_type1($cfgRec,&$session_data,$params,&$pObj) X-Ref |
Indexing records from a table param: array Indexing Configuration Record param: array Session data for the indexing session spread over multiple instances of the script. Passed by reference so changes hereto will be saved for the next call! param: array Parameters from the log queue. param: object Parent object (from "crawler" extension!) return: void |
crawler_execute_type2($cfgRec,&$session_data,$params,&$pObj) X-Ref |
Indexing files from fileadmin param: array Indexing Configuration Record param: array Session data for the indexing session spread over multiple instances of the script. Passed by reference so changes hereto will be saved for the next call! param: array Parameters from the log queue. param: object Parent object (from "crawler" extension!) return: void |
crawler_execute_type3($cfgRec,&$session_data,$params,&$pObj) X-Ref |
Indexing External URLs param: array Indexing Configuration Record param: array Session data for the indexing session spread over multiple instances of the script. Passed by reference so changes hereto will be saved for the next call! param: array Parameters from the log queue. param: object Parent object (from "crawler" extension!) return: void |
crawler_execute_type4($cfgRec,&$session_data,$params,&$pObj) X-Ref |
Page tree indexing type param: array Indexing Configuration Record param: array Session data for the indexing session spread over multiple instances of the script. Passed by reference so changes hereto will be saved for the next call! param: array Parameters from the log queue. param: object Parent object (from "crawler" extension!) return: void |
cleanUpOldRunningConfigurations() X-Ref |
Look up all old index configurations which are finished and needs to be reset and done return: void |
checkUrl($url,$urlLog,$baseUrl) X-Ref |
Check if an input URL are allowed to be indexed. Depends on whether it is already present in the url log. param: string URL string to check param: array Array of already indexed URLs (input url is looked up here and must not exist already) param: string Base URL of the indexing process (input URL must be "inside" the base URL!) return: string Returls the URL if OK, otherwise false |
indexExtUrl($url, $pageId, $rl, $cfgUid, $setId) X-Ref |
Indexing External URL param: string URL, http://.... param: integer Page id to relate indexing to. param: array Rootline array to relate indexing to param: integer Configuration UID param: integer Set ID value return: array URLs found on this page |
indexSingleRecord($r,$cfgRec,$rl=NULL) X-Ref |
Indexing Single Record param: array Record to index param: array Configuration Record param: array Rootline array to relate indexing to return: void |
loadIndexerClass() X-Ref |
Include indexer class. return: void |
getUidRootLineForClosestTemplate($id) X-Ref |
Get rootline for closest TypoScript template root. Algorithm same as used in Web > Template, Object browser param: integer The page id to traverse rootline back from return: array Array where the root lines uid values are found. |
generateNextIndexingTime($cfgRec) X-Ref |
Generate the unix time stamp for next visit. param: array Index configuration record return: integer The next time stamp |
checkDeniedSuburls($url, $url_deny) X-Ref |
Checks if $url has any of the URls in the $url_deny "list" in it and if so, returns true. param: string URL to test param: string String where URLs are separated by line-breaks; If any of these strings is the first part of $url, the function returns TRUE (to indicate denial of decend) return: boolean TRUE if there is a matching URL (hence, do not index!) |
addQueueEntryForHook($cfgRec, $title) X-Ref |
Adding entry in queue for Hook param: array Configuration record param: string Title/URL return: void |
deleteFromIndex($id) X-Ref |
Deletes all data stored by indexed search for a given page param: integer Uid of the page to delete all pHash return: void |
processCmdmap_preProcess($command, $table, $id, $value, &$pObj) X-Ref |
TCEmain hook function for on-the-fly indexing of database records param: string TCEmain command param: string Table name param: string Record ID. If new record its a string pointing to index inside t3lib_tcemain::substNEWwithIDs param: mixed Target value (ignored) param: object Reference to tcemain calling object return: void |
processDatamap_afterDatabaseOperations($status, $table, $id, $fieldArray, &$pObj) X-Ref |
TCEmain hook function for on-the-fly indexing of database records param: string Status "new" or "update" param: string Table name param: string Record ID. If new record its a string pointing to index inside t3lib_tcemain::substNEWwithIDs param: array Field array of updated fields in the operation param: object Reference to tcemain calling object return: void |
Classe: tx_indexedsearch_files - X-Ref
Crawler hook for indexed search. Works with the "crawler" extensioncrawler_execute($params,&$pObj) X-Ref |
Call back function for execution of a log element param: array Params from log element. param: object Parent object (tx_crawler lib) return: array Result array |
loadIndexerClass() X-Ref |
Include indexer class. return: void |
Généré le : Sun Nov 25 17:13:16 2007 | par Balluche grâce à PHPXref 0.7 |
![]() |