2019-10-29 16:06:05 3532次浏览 3条回答 0 悬赏 10 金钱

大家好,请问一下 PHP curl 为什么不能抓取微信公众号的文章?

<?php

//搜狗抓取微信公众号
$url="https://mp.weixin.qq.com/s?src=11&timestamp=1572321601&ver=1941&signature=nDP0iANU0l5g-Iy8uGpL*FUpa*DQGdh2RIIJ5cS672deXdCg2MoTcZRgAd1QlCLfYGl8JKPuw8a09JErds8EU65GdxhUP-sxY1iaZsTw1UrcasaYwpPE7MlGZJNo3KNN&new=1";
$ifpost = 0;
$datafields = '';
$cookiefile = '';
$v = false;
//构造随机ip
$ip_long = array(
    array('607649792', '608174079'), //36.56.0.0-36.63.255.255
    array('1038614528', '1039007743'), //61.232.0.0-61.237.255.255
    array('1783627776', '1784676351'), //106.80.0.0-106.95.255.255
    array('2035023872', '2035154943'), //121.76.0.0-121.77.255.255
    array('2078801920', '2079064063'), //123.232.0.0-123.235.255.255
    array('-1950089216', '-1948778497'), //139.196.0.0-139.215.255.255
    array('-1425539072', '-1425014785'), //171.8.0.0-171.15.255.255
    array('-1236271104', '-1235419137'), //182.80.0.0-182.92.255.255
    array('-770113536', '-768606209'), //210.25.0.0-210.47.255.255
    array('-569376768', '-564133889'), //222.16.0.0-222.95.255.255
);
$rand_key = mt_rand(0, 9);
$ip= long2ip(mt_rand($ip_long[$rand_key][0], $ip_long[$rand_key][1]));
//模拟http请求header头
$header = array("Connection: Keep-Alive","Accept: text/html, application/xhtml+xml, */*", "Pragma: no-cache", "Accept-Language: zh-Hans-CN,zh-Hans;q=0.8,en-US;q=0.5,en;q=0.3","User-Agent: Mozilla/5.0 (compatible; MSIE 10.0; Windows NT 6.2; WOW64; Trident/6.0)",'CLIENT-IP:'.$ip,'X-FORWARDED-FOR:'.$ip);
$ch = curl_init();
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch, CURLOPT_HEADER, $v);
curl_setopt($ch, CURLOPT_HTTPHEADER, $header);
$ifpost && curl_setopt($ch, CURLOPT_POST, $ifpost);
$ifpost && curl_setopt($ch, CURLOPT_POSTFIELDS, $datafields);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);
$cookiefile && curl_setopt($ch, CURLOPT_COOKIEFILE, $cookiefile);
$cookiefile && curl_setopt($ch, CURLOPT_COOKIEJAR, $cookiefile);
curl_setopt($ch,CURLOPT_TIMEOUT,30); //允许执行的最长秒数
$ok = curl_exec($ch);
curl_close($ch);
unset($ch);
//正则匹配出微信名称
preg_match_all('|<label name="em_weixinhao">(.*?)<\/label>|is',$ok,$m);
var_dump($m);

最佳答案

  • 回答于 2019-10-30 09:09 举报

    你这不是废物测试吗,直接扔代码出来,看是你写错了,还是PHPcrul不能抓

    1 条回复
    回复于 2019-10-30 09:49 回复
    <?php
    
    //搜狗抓取微信公众号
    $url="https://mp.weixin.qq.com/s?src=11&timestamp=1572321601&ver=1941&signature=nDP0iANU0l5g-Iy8uGpL*FUpa*DQGdh2RIIJ5cS672deXdCg2MoTcZRgAd1QlCLfYGl8JKPuw8a09JErds8EU65GdxhUP-sxY1iaZsTw1UrcasaYwpPE7MlGZJNo3KNN&new=1";
    $ifpost = 0;
    $datafields = '';
    $cookiefile = '';
    $v = false;
    //构造随机ip
    $ip_long = array(
        array('607649792', '608174079'), //36.56.0.0-36.63.255.255
        array('1038614528', '1039007743'), //61.232.0.0-61.237.255.255
        array('1783627776', '1784676351'), //106.80.0.0-106.95.255.255
        array('2035023872', '2035154943'), //121.76.0.0-121.77.255.255
        array('2078801920', '2079064063'), //123.232.0.0-123.235.255.255
        array('-1950089216', '-1948778497'), //139.196.0.0-139.215.255.255
        array('-1425539072', '-1425014785'), //171.8.0.0-171.15.255.255
        array('-1236271104', '-1235419137'), //182.80.0.0-182.92.255.255
        array('-770113536', '-768606209'), //210.25.0.0-210.47.255.255
        array('-569376768', '-564133889'), //222.16.0.0-222.95.255.255
    );
    $rand_key = mt_rand(0, 9);
    $ip= long2ip(mt_rand($ip_long[$rand_key][0], $ip_long[$rand_key][1]));
    //模拟http请求header头
    $header = array("Connection: Keep-Alive","Accept: text/html, application/xhtml+xml, */*", "Pragma: no-cache", "Accept-Language: zh-Hans-CN,zh-Hans;q=0.8,en-US;q=0.5,en;q=0.3","User-Agent: Mozilla/5.0 (compatible; MSIE 10.0; Windows NT 6.2; WOW64; Trident/6.0)",'CLIENT-IP:'.$ip,'X-FORWARDED-FOR:'.$ip);
    $ch = curl_init();
    curl_setopt($ch, CURLOPT_URL, $url);
    curl_setopt($ch, CURLOPT_HEADER, $v);
    curl_setopt($ch, CURLOPT_HTTPHEADER, $header);
    $ifpost && curl_setopt($ch, CURLOPT_POST, $ifpost);
    $ifpost && curl_setopt($ch, CURLOPT_POSTFIELDS, $datafields);
    curl_setopt($ch, CURLOPT_RETURNTRANSFER, true);
    curl_setopt($ch, CURLOPT_FOLLOWLOCATION, true);
    $cookiefile && curl_setopt($ch, CURLOPT_COOKIEFILE, $cookiefile);
    $cookiefile && curl_setopt($ch, CURLOPT_COOKIEJAR, $cookiefile);
    curl_setopt($ch,CURLOPT_TIMEOUT,30); //允许执行的最长秒数
    $ok = curl_exec($ch);
    curl_close($ch);
    unset($ch);
    //正则匹配出微信名称
    preg_match_all('|<label name="em_weixinhao">(.*?)<\/label>|is',$ok,$m);
    var_dump($m);
    
  • 回答于 2019-10-30 15:00 举报

    chrome扩展了解下,万能抓

您需要登录后才可以回答。登录 | 立即注册
PHP学院的中学生
副总裁

PHP学院的中学生

注册时间:2018-10-23
最后登录:2025-03-24
在线时长:168小时46分
  • 粉丝29
  • 金钱4730
  • 威望30
  • 积分6710

热门问题