用PHP抓取百度贴吧邮箱数据 phpsay贴吧系统 php贴吧开源系统 特战旅php培训贴

php中文网
发布: 2016-07-29 08:53:55
原创
1294人浏览过

注:本程序可能非常适合那些做百度贴吧营销的朋友。

去逛百度贴吧的时候,经常会看到楼主分享一些资源,要求留下邮箱,楼主才给发。

对于一个热门的帖子,留下的邮箱数量是非常多的,楼主需要一个一个的去复制那些回复的邮箱,然后再粘贴发送邮件,不是被折磨死就是被累死。无聊至极写了一个抓取百度贴吧邮箱数据的程序,需要的拿走。

程序实现了一键抓取帖子全部邮箱和分页抓取邮箱两个功能,界面懒得做了,效果如下:

仿百度贴吧php源码,php模拟登录百度贴吧,仿百度贴吧php,php贴吧,php贴吧源码,php贴吧程序,phpsay贴吧系统,php贴吧开源系统,特战旅php培训贴

立即学习PHP免费学习笔记(深入)”;

老规矩,直接贴源码

<?<span>php
</span><span>$url2</span>=""<span>;
</span><span>$page</span>=""<span>;
</span><span>if</span>(<span>$_GET</span>['url2']==""<span>){
    </span><span>$url2</span>="http://tieba.baidu.com/p/2314539885?pn=1"<span>;
}</span><span>else</span><span>{
    </span><span>$url2</span>=<span>$_GET</span>['url2'<span>];
}

</span><span>if</span>(<span>$_GET</span>['page']==""<span>){
    </span><span>$page</span>="1"<span>;
}</span><span>else</span><span>{
    </span><span>$page</span>=<span>$_GET</span>['page'<span>];
}
</span>?>
<form action="" method="get">
<input type="hidden" value="getAll" name="type" />
<table>
    <tr>
        <td>帖子链接:</td><td><input type="text" name="url" value="http://tieba.baidu.com/p/2314539885" /></td>
    </tr>
    <tr>
        <td>总页数:</td><td><input type="text" name="page" value="<?php echo <span>$page</span>;?>" /></td>
    </tr>
    <tr>
        <td colspan=2><input type="submit" value="抓取全部邮箱数据" /></td>
    </tr>
</table>
</form>

<form action="" method="get">
<input type="hidden" value="getNow" name="type" />
<table>
    <tr>
        <td>帖子链接:</td><td><input type="text" name="url2" value="<?php echo <span>$url2</span>;?>" /></td>
    </tr>
    <tr>
        <td colspan=2><input type="submit" value="抓取当前页面邮箱数据" /></td>
    </tr>
</table>
</form>
<?<span>php
</span><span>if</span>(<span>$_GET</span>['type']!=""<span>){
    </span><span>$counts</span>=0<span>;
    </span><span>if</span>(<span>$_GET</span>['type']=="getAll"<span>){
        </span><span>$pages</span>=<span>$_GET</span>['page'<span>];
        </span><span>$url</span> = <span>$_GET</span>['url'<span>];
        </span><span>for</span>(<span>$i</span>=0;<span>$i</span><<span>$pages</span>;<span>$i</span>++<span>){
            </span><span>$ch2</span> =<span> curl_init();
            curl_setopt(</span><span>$ch2</span>, CURLOPT_URL, <span>$url</span><span>);
            curl_setopt(</span><span>$ch2</span>, CURLOPT_FOLLOWLOCATION, <span>TRUE</span><span>);
            curl_setopt(</span><span>$ch2</span>, CURLOPT_SSL_VERIFYHOST, <span>FALSE</span><span>);
            curl_setopt(</span><span>$ch2</span>, CURLOPT_SSL_VERIFYPEER, <span>false</span><span>);
            curl_setopt(</span><span>$ch2</span>, CURLOPT_RETURNTRANSFER, <span>TRUE</span><span>);
            </span><span>$texts</span> = curl_exec(<span>$ch2</span><span>);
            curl_close(</span><span>$ch2</span><span>);
            </span><span>$dat</span>=getEmail(<span>$texts</span><span>);
            </span><span>for</span>(<span>$j</span>=0;<span>$j</span><<span>count</span>(<span>$dat</span>);<span>$j</span>++<span>){
                </span><span>echo</span><span>$dat</span>[<span>$j</span>]."<br />"<span>;
                </span><span>$counts</span>++<span>;
            }
        }
    }</span><span>else</span><span>if</span>(<span>$_GET</span>['type']=="getNow"<span>){
        </span><span>$url</span> = <span>$_GET</span>['url2'<span>];
        </span><span>$ch2</span> =<span> curl_init();
        curl_setopt(</span><span>$ch2</span>, CURLOPT_URL, <span>$url</span><span>);
        curl_setopt(</span><span>$ch2</span>, CURLOPT_FOLLOWLOCATION, <span>TRUE</span><span>);
        curl_setopt(</span><span>$ch2</span>, CURLOPT_SSL_VERIFYHOST, <span>FALSE</span><span>);
        curl_setopt(</span><span>$ch2</span>, CURLOPT_SSL_VERIFYPEER, <span>false</span><span>);
        curl_setopt(</span><span>$ch2</span>, CURLOPT_RETURNTRANSFER, <span>TRUE</span><span>);
        </span><span>$texts</span> = curl_exec(<span>$ch2</span><span>);
        curl_close(</span><span>$ch2</span><span>);
        </span><span>$dat</span>=getEmail(<span>$texts</span><span>);
        </span><span>for</span>(<span>$i</span>=0;<span>$i</span><<span>count</span>(<span>$dat</span>);<span>$i</span>++<span>){
            </span><span>echo</span><span>$dat</span>[<span>$i</span>]."<br />"<span>;
            </span><span>$counts</span>++<span>;
        }
    }
    </span><span>echo</span> '<h2>共采集到数据:'.<span>$counts</span>.'条</h2>'<span>;
}
</span><span>function</span> getEmail(<span>$str</span><span>){
    </span><span>$pattern</span> = "/([a-z0-9\-_\.]+@[a-z0-9]+\.[a-z0-9\-_\.]+)/"<span>; 
    </span><span>preg_match_all</span>(<span>$pattern</span>,<span>$str</span>,<span>$emailArr</span><span>); 
    </span><span>return</span><span>$emailArr</span>[0<span>]; 
}
</span>?>
登录后复制

以上就介绍了用PHP抓取百度贴吧邮箱数据,包括了百度贴吧,php方面的内容,希望对PHP教程有兴趣的朋友有所帮助。

PHP速学教程(入门到精通)
PHP速学教程(入门到精通)

PHP怎么学习?PHP怎么入门?PHP在哪学?PHP怎么学才快?不用担心,这里为大家提供了PHP速学教程(入门到精通),有需要的小伙伴保存下载就能学习啦!

下载
来源:php中文网
本文内容由网友自发贡献,版权归原作者所有,本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容,请联系admin@php.cn
最新问题
开源免费商场系统广告
热门教程
更多>
最新下载
更多>
网站特效
网站源码
网站素材
前端模板
关于我们 免责申明 意见反馈 讲师合作 广告合作 最新更新
php中文网:公益在线php培训,帮助PHP学习者快速成长!
关注服务号 技术交流群
PHP中文网订阅号
每天精选资源文章推送
PHP中文网APP
随时随地碎片化学习
PHP中文网抖音号
发现有趣的

Copyright 2014-2025 https://www.php.cn/ All Rights Reserved | php.cn | 湘ICP备2023035733号