Changeset 1872
- Timestamp:
- 11/28/06 01:44:28
- Files:
-
- trunk (modified) (previous)
- trunk/plagger/lib/Plagger/Plugin/Summary/Simple.pm (modified) (1 diff)
- trunk/plagger/t/plugins/Summary-Simple/base.t (modified) (1 diff)
Legend:
- Unmodified
- Added
- Removed
- Modified
- Copied
- Moved
trunk/plagger/lib/Plagger/Plugin/Summary/Simple.pm
r1779 r1872 21 21 local $HTML::Tagset::isBodyElement{div} = 0; 22 22 my $html = $text->data; 23 while ($html =~ s|^\s*<( \w*)\s*[^>]*>(.*?)</\1>|$2|gs) {23 while ($html =~ s|^\s*<([^ >]+)(?:\s+[^>]+)?>(.*?)</\1>|$2|gs) { 24 24 if ($HTML::Tagset::isBodyElement{lc($1)}) { 25 25 return "<$1>$2</$1>"; trunk/plagger/t/plugins/Summary-Simple/base.t
r1778 r1872 82 82 <p>First paragraph</p> 83 83 84 === Make sure element names are extracted properly 85 --- input 86 <img src="..."> <i><a href="...">more text</a></i> some more text 87 --- expected 88 <img src="..."> <i><a href="...">more text</a></i> some more text 89 84 90 === I18N. Japanese plaintext 85 91 --- input
