Changeset 839
- Timestamp:
- 05/29/06 17:49:30
- Files:
Legend:
- Unmodified
- Added
- Removed
- Modified
- Copied
- Moved
trunk/plagger/assets/plugins/Filter-EntryFullText/itmedia.yaml
r805 r839 1 1 author: manabou 2 2 handle: http://(\w+)\.itmedia\.co\.jp/(\w+)/articles 3 extract: <div class="newart">.*?<h1>(.*?)</h1>.*?<!--BODY-->(.*?)<!--BODYEND--> 4 extract_capture: title body 3 extract: <div class="newart">.*?(?:<div id="update">(\d{4}年\d\d月\d\d日 \d\d時\d\d分) 更新</div>)?.*?<h1>(.*?)</h1>\s*<h5>(.*?)</h5>\s*(?:<div id="update">(\d{4}年\d\d月\d\d日 \d\d時\d\d分) 更新</div>)?.*?<!--BODY-->(.*?)<!--BODYEND--> 4 extract_capture: date1 title summary date2 body 5 extract_after_hook: $data->{date} = $data->{date1} || $data->{date2} 6 extract_date_format: %Y年%m月%d日 %H時%M分
