62,046
社区成员
发帖
与我相关
我的任务
分享
<h3\sclass="c-title"><a\shref="(?<Url>[^"]*)[^>]*>(?<Title>.*?)</a>
每页有20个,前10个自行在程序中计数搞定。
WebClient wc = new WebClient();
string html = Encoding.UTF8.GetString(wc.DownloadData("http://news.baidu.com/ns?from=news&cl=2&bt=0&y0=2013&m0=11&d0=6&y1=2013&m1=11&d1=6&et=0&q1=%B1%B1%BE%A9&submit=%B0%D9%B6%C8%D2%BB%CF%C2&q3=&q4=&s=1&mt=0&lm=0&begin_date=2013-11-6&end_date=2013-11-6&tn=newsdy&ct1=1&ct=1&rn=20&q6="));
MatchCollection mc = Regex.Matches(html, "(?is)class=\"c-title\".*?href=\"(.*?)\".*?data-click=.*?>(.*?)</a>", RegexOptions.Singleline);
int i = 0;
foreach (Match match in mc)
{
i++;
if (i > 10) break;
Console.WriteLine(match.Groups[1].Value + " " + match.Groups[2].Value);
}