Below I have a sample HTML. In the HTML there is a script. In the script there is an array. I want to extract this array. Preferably as an array in PHP but also extracting it as a string is also good. I don't understand how to target that part of the script. The part of the script I want to extract is:(1234567, 111, 'Red Pants', 'https://example.com/red', '', 1, '','','0','0','0', 'buy')
.
It seems that the node value is the entire script. I am unsure how to isolate it.
PHP
<pre><?php
$content = file_get_contents("test.html");
$dom = new DOMDocument();
@$dom->loadHTML($content);
$dom->preserveWhiteSpace = false;
foreach($dom->getElementsByTagName('script') as $scripts) {
var_dump($scripts);
}
HTML
<!doctype html>
<html>
<head>
<title>Example Domain</title>
<meta charset="utf-8" />
<meta http-equiv="Content-type" content="text/html; charset=utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1" />
</head>
<script>
function init() {
YAHOO.util.Event.on(['adding_to_cart','adding_to_trolley_sub'], 'click', addProdToTrolley);
YAHOO.util.Event.on('add-prod-to-watchlist', 'click', addProdToWatchList);
YAHOO.util.Event.on(['buy_now','buy_now_sub'], 'click', function() { SHOP.Purchase.buy(1234567, 111, 'Red Pants', 'https://example.com/red', '', 1, '','','0','0','0', 'buy');});
YAHOO.util.Event.on(['frm_buttons','frm_buttons_sub'], 'submit', function(e) { YAHOO.util.Event.stopEvent(e); });
};
var id = 1234567;
function addProdToTrolley(e) {
YAHOO.util.Event.stopEvent(e);
document.trolley_quantity.trolley_action.value = 'Add to cart';
document.trolley_quantity.tid.value = '';
document.trolley_quantity.prod_id.value = id;
disableButtons();
document.trolley_quantity.submit();
};
</script>
You can useregex
and extract fromHTML
content without xml dom:
preg_match_all('/<script\b[^>]*>(.*?)<\/script>/is', $html, $matches);
And use substr to target starting withSHOP.Purchase.buy(
to end of array in;});
or directly use regex to extract the JS data:
preg_match_all('/SHOP.Purchase.buy(.*?);}\);/is', $string, $matches);
now$matches[1][0]
is extracted data inside of script:
(1234567, 111, 'Red Pants', 'https://example.com/red', '', 1, '','','0','0','0', 'buy')
you can usepreg_match
if you want to find first matched script insteadpreg_match_all
.
Our community is visited by hundreds of web development professionals every day. Ask your question and get a quick answer for free.
Find the answer in similar questions on our website.
Do you know the answer to this question? Write a quick response to it. With your help, we will make our community stronger.
PHP (from the English Hypertext Preprocessor - hypertext preprocessor) is a scripting programming language for developing web applications. Supported by most hosting providers, it is one of the most popular tools for creating dynamic websites.
The PHP scripting language has gained wide popularity due to its processing speed, simplicity, cross-platform, functionality and distribution of source codes under its own license.
https://www.php.net/
HTML (English "hyper text markup language" - hypertext markup language) is a special markup language that is used to create sites on the Internet.
Browsers understand html perfectly and can interpret it in an understandable way. In general, any page on the site is html-code, which the browser translates into a user-friendly form. By the way, the code of any page is available to everyone.
https://www.w3.org/html/
Welcome to the Q&A site for web developers. Here you can ask a question about the problem you are facing and get answers from other experts. We have created a user-friendly interface so that you can quickly and free of charge ask a question about a web programming problem. We also invite other experts to join our community and help other members who ask questions. In addition, you can use our search for questions with a solution.
Ask about the real problem you are facing. Describe in detail what you are doing and what you want to achieve.
Our goal is to create a strong community in which everyone will support each other. If you find a question and know the answer to it, help others with your knowledge.