代码语言
.
CSharp
.
JS
Java
Asp.Net
C
MSSQL
PHP
Css
PLSQL
Python
Shell
EBS
ASP
Perl
ObjC
VB.Net
VBS
MYSQL
GO
Delphi
AS
DB2
Domino
Rails
ActionScript
Scala
代码分类
文件
系统
字符串
数据库
网络相关
图形/GUI
多媒体
算法
游戏
Jquery
Extjs
Android
HTML5
菜单
网页交互
WinForm
控件
企业应用
安全与加密
脚本/批处理
开放平台
其它
【
CSharp
】
winform 逐个获取网页的标题
作者:
Dezai.CN
/ 发布于
2011/9/21
/
695
<div><span style="color: rgb(0,0,255)">using</span><span style="color: rgb(0,0,0)"> System; </span><span style="color: rgb(0,0,255)">using</span><span style="color: rgb(0,0,0)"> System.Net; </span><span style="color: rgb(0,0,255)">using</span><span style="color: rgb(0,0,0)"> System.Text; </span><span style="color: rgb(0,0,255)">using</span><span style="color: rgb(0,0,0)"> System.Text.RegularExpressions; </span><span style="color: rgb(0,0,255)">class</span><span style="color: rgb(0,0,0)"> Program { </span><span style="color: rgb(0,128,0)">//</span><span style="color: rgb(0,128,0)"> 获取网页的HTML内容,根据网页的charset自动判断Encoding</span><span style="color: rgb(0,128,0)"> </span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">static</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> GetHtml(</span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> url) { </span><span style="color: rgb(0,0,255)">return</span><span style="color: rgb(0,0,0)"> GetHtml(url, </span><span style="color: rgb(0,0,255)">null</span><span style="color: rgb(0,0,0)">); } </span><span style="color: rgb(0,128,0)">//</span><span style="color: rgb(0,128,0)"> 获取网页的HTML内容,指定Encoding</span><span style="color: rgb(0,128,0)"> </span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">static</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> GetHtml(</span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> url, Encoding encoding) { </span><span style="color: rgb(0,0,255)">byte</span><span style="color: rgb(0,0,0)">[] buf </span><span style="color: rgb(0,0,0)">=</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">new</span><span style="color: rgb(0,0,0)"> WebClient().DownloadData(url); </span><span style="color: rgb(0,0,255)">if</span><span style="color: rgb(0,0,0)"> (encoding </span><span style="color: rgb(0,0,0)">!=</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">null</span><span style="color: rgb(0,0,0)">) </span><span style="color: rgb(0,0,255)">return</span><span style="color: rgb(0,0,0)"> encoding.GetString(buf); </span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> html </span><span style="color: rgb(0,0,0)">=</span><span style="color: rgb(0,0,0)"> Encoding.UTF8.GetString(buf); encoding </span><span style="color: rgb(0,0,0)">=</span><span style="color: rgb(0,0,0)"> GetEncoding(html); </span><span style="color: rgb(0,0,255)">if</span><span style="color: rgb(0,0,0)"> (encoding </span><span style="color: rgb(0,0,0)">==</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">null</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,0)">||</span><span style="color: rgb(0,0,0)"> encoding </span><span style="color: rgb(0,0,0)">==</span><span style="color: rgb(0,0,0)"> Encoding.UTF8) </span><span style="color: rgb(0,0,255)">return</span><span style="color: rgb(0,0,0)"> html; </span><span style="color: rgb(0,0,255)">return</span><span style="color: rgb(0,0,0)"> encoding.GetString(buf); } </span><span style="color: rgb(0,128,0)">//</span><span style="color: rgb(0,128,0)"> 根据网页的HTML内容提取网页的Encoding</span><span style="color: rgb(0,128,0)"> </span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">static</span><span style="color: rgb(0,0,0)"> Encoding GetEncoding(</span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> html) { </span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> pattern </span><span style="color: rgb(0,0,0)">=</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(128,0,0)">@"</span><span style="color: rgb(128,0,0)">(?i)\bcharset=(?<charset>[-a-zA-Z_0-9]+)</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(0,0,0)">; </span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> charset </span><span style="color: rgb(0,0,0)">=</span><span style="color: rgb(0,0,0)"> Regex.Match(html, pattern).Groups[</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(128,0,0)">charset</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(0,0,0)">].Value; </span><span style="color: rgb(0,0,255)">try</span><span style="color: rgb(0,0,0)"> { </span><span style="color: rgb(0,0,255)">return</span><span style="color: rgb(0,0,0)"> Encoding.GetEncoding(charset); } </span><span style="color: rgb(0,0,255)">catch</span><span style="color: rgb(0,0,0)"> (ArgumentException) { </span><span style="color: rgb(0,0,255)">return</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">null</span><span style="color: rgb(0,0,0)">; } } </span><span style="color: rgb(0,128,0)">//</span><span style="color: rgb(0,128,0)"> 根据网页的HTML内容提取网页的Title</span><span style="color: rgb(0,128,0)"> </span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">static</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> GetTitle(</span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> html) { </span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> pattern </span><span style="color: rgb(0,0,0)">=</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(128,0,0)">@"</span><span style="color: rgb(128,0,0)">(?si)<title(?:\s+(?:""[^""]*""|'[^']*'|[^""'>])*)?>(?<title>.*?)</title></span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(0,0,0)">; </span><span style="color: rgb(0,0,255)">return</span><span style="color: rgb(0,0,0)"> Regex.Match(html, pattern).Groups[</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(128,0,0)">title</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(0,0,0)">].Value.Trim(); } </span><span style="color: rgb(0,128,0)">//</span><span style="color: rgb(0,128,0)"> 打印网页的Encoding和Title</span><span style="color: rgb(0,128,0)"> </span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">static</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">void</span><span style="color: rgb(0,0,0)"> PrintEncodingAndTitle(</span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> url) { </span><span style="color: rgb(0,0,255)">string</span><span style="color: rgb(0,0,0)"> html </span><span style="color: rgb(0,0,0)">=</span><span style="color: rgb(0,0,0)"> GetHtml(url); Console.WriteLine(</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(128,0,0)">[{0}] [{1}]</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(0,0,0)">, GetEncoding(html), GetTitle(html)); } </span><span style="color: rgb(0,128,0)">//</span><span style="color: rgb(0,128,0)"> 程序入口</span><span style="color: rgb(0,128,0)"> </span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">static</span><span style="color: rgb(0,0,0)"> </span><span style="color: rgb(0,0,255)">void</span><span style="color: rgb(0,0,0)"> Main() { PrintEncodingAndTitle(</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(128,0,0)">http://www.msdn.net/</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(0,0,0)">); PrintEncodingAndTitle(</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(128,0,0)">http://www.cnblogs.com/</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(0,0,0)">); PrintEncodingAndTitle(</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(128,0,0)">http://www.cnblogs.com/skyiv/</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(0,0,0)">); PrintEncodingAndTitle(</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(128,0,0)">http://www.csdn.net/</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(0,0,0)">); PrintEncodingAndTitle(</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(128,0,0)">http://news.163.com/</span><span style="color: rgb(128,0,0)">"</span><span style="color: rgb(0,0,0)">); } } </span><span style="color: rgb(0,128,0)">/**//*</span><span style="color: rgb(0,128,0)"> 程序输出: [] [MSDN: Microsoft Developer Network] [System.Text.UTF8Encoding] [博客园 - 程序员的网上家园] [System.Text.UTF8Encoding] [空间/IV - 博客园] [System.Text.UTF8Encoding] [CSDN.NET - 中国最大的IT技术社区,为IT专业技术人员提供最全面的信息传播和服务平台] [System.Text.DBCSCodePageEncoding] [新闻中心_网易新闻] </span><span style="color: rgb(0,128,0)">*/</span></div>
试试其它关键字
获取网页的标题
同语言下
.
文件IO 操作类库
.
Check图片类型[JPEG(.jpg 、.jpeg),TIF,GIF,BMP,PNG,P
.
机器名和IP取得(IPV4 IPV6)
.
Tiff转换Bitmap
.
linqHelper
.
MadieHelper.cs
.
RegHelper.cs
.
如果关闭一个窗体后激活另一个窗体的事件或方法
.
创建日志通用类
.
串口辅助开发类
可能有用的
.
C#实现的html内容截取
.
List 切割成几份 工具类
.
SQL查询 多列合并成一行用逗号隔开
.
一行一行读取txt的内容
.
C#动态修改文件夹名称(FSO实现,不移动文件)
.
c# 移动文件或文件夹
.
c#图片添加水印
.
Java PDF转换成图片并输出给前台展示
.
网站后台修改图片尺寸代码
.
处理大图片在缩略图时的展示
Dezai.CN
贡献的其它代码
(
4037
)
.
多线程Socket服务器模块
.
生成随机密码
.
清除浮动样式
.
弹出窗口居中
.
抓取url的函数
.
使用base HTTP验证
.
div模拟iframe嵌入效果
.
通过header转向的方法
.
Session操作类
.
执行sqlite输入插入操作后获得自动编号的ID
Copyright © 2004 - 2024 dezai.cn. All Rights Reserved
站长博客
粤ICP备13059550号-3