php class ตัวนี้ เหมาะสำหรับใช้งาน ในการค้นหา และดึงข้อมูล จาก แท็ก HTML เฉพาะส่วนที่ต้องการ เช่น
1
2
3
4
5
6
7
8
9
10
11
12
13
14
$ret
=
$html
->find(
'a'
);
$ret
=
$html
->find(
'a'
, 0);
$ret
=
$html
->find(
'div[id=foo]'
);
$ret
=
$html
->find(
'div[id]'
);
$ret
=
$html
->find(
'[id]'
);
อ่านวิธีการใช้งานแบบละเอียดเพิ่มเติมได้ด้วยตัวเองได้ที่
http://simplehtmldom.sourceforge.net/manual.htm
ดาวน์โหลดไฟล์ php class สำหรับใช้งานได้ที่
http://sourceforge.net/projects/simplehtmldom/files/
แนะนำให้โหลดเก็บไว้ใช้งาน รวมทั้งบันทึกวิธีการใช้งานไว้ด้วย
คู่มือ
Index
Top
How to create HTML DOM object?
Top
$html = str_get_html ('<html><body>Hello!</body></html>' );
$html = file_get_html ('http://www.google.com/' );
$html = file_get_html ('test.htm' );
$html = new simple_html_dom ();
$html->load ('<html><body>Hello!</body></html>' );
$html->load_file ('http://www.google.com/' );
$html->load_file ('test.htm' );
How to find HTML elements?
Top
$ret = $html->find('a ' );
$ret = $html->find('a ', 0 );
$ret = $html->find('div[id=foo] ' );
$ret = $html->find('div[id] ' );
$ret = $html->find('[id] ' );
$ret = $html->find('#foo ' );
$ret = $html->find('.foo ' );
$ret = $html->find('a, img ' );
$ret = $html->find('a[title], img[title] ' );
Supports these operators in attribute selectors:
Filter
Description
[attribute]
Matches elements that have the specified attribute.
[attribute=value]
Matches elements that have the specified attribute with a certain value .
[attribute!=value]
Matches elements that don't have the specified attribute with a certain value.
[attribute^=value]
Matches elements that have the specified attribute and it starts with a certain value.
[attribute$=value]
Matches elements that have the specified attribute and it ends with a certain value.
[attribute*=value]
Matches elements that have the specified attribute and it contains a certain value.
$es = $html->find('ul li ' );
$es = $html->find('div div div ' );
$es = $html->find('table.hello td ' );
$es = $html->find(''table td[align=center] ' );
foreach($html->find('ul ' ) as $ul)
{
foreach($ul->find('li ' ) as $li)
{
}
}
$e = $html->find('ul ', 0 )->find('li ', 0 );
How to access the HTML element's attributes?
Top
$value = $e->href ;
$e->href = 'my link' ;
$e->href = null ;
if(isset($e->href ))
echo 'href exist!' ;
$ html = str_get_html
( "<div>foo <b>bar</b></div>" ) ;
$e = $html->find(
"div" ,
0 );
echo $e->
tag ;
echo $e->
outertext ;
echo $e->
innertext ;
echo $e->
plaintext ;
Attribute Name
Usage
$e->tag
Read or write the tag name of element.
$e->outertext
Read or write the outer HTML text of element.
$e->innertext
Read or write the inner HTML text of element.
$e->plaintext
Read or write the plain text of element.
echo $html ->plaintext ;
$e->outertext = '<div class="wrap">' . $e->outertext . '<div> ';
$e->outertext = '' ;
$e->outertext = $e->outertext . '<div>foo <div> ';
$e->outertext = '<div>foo <div> ' . $e->outertext ;
How to traverse the DOM tree?
Top
echo $html->find ("#div1", 0 )->children (1 )->children (1 )->children (2 )->id ;
echo $html->getElementById ("div1" )->childNodes (1 )->childNodes (1 )->childNodes (2 )->getAttribute ('id' );
You can also call methods with
Camel naming convertions .
Method
Description
mixed
$e->children ( [int $index] )
Returns the Nth child object if index is set, otherwise return an array of children .
element
$e->parent ()
Returns the parent of element.
element
$e->first_child ()
Returns the first child of element, or null if not found.
element
$e->last_child ()
Returns the last child of element, or null if not found.
element
$e->next_sibling ()
Returns the next sibling of element, or null if not found.
element
$e->prev_sibling ()
Returns the previous sibling of element, or null if not found.
How to dump contents of DOM object?
Top
$str = $html->save ();
$html->save ('result.htm' );
$str = $html;
echo $html;
How to customize the parsing behavior?
Top
function my_callback($element ) {
if ($element->tag=='b' )
$element->outertext = '';
}
$html->set_callback ('my_callback' );
echo $html;
Author: S.C. Chen (me578022@gmail.com)
Original idea is from Jose Solorzano's
HTML Parser for PHP 4 .
Contributions by: Yousuke Kumakura (Attribute Filters)