110,499
社区成员
发帖
与我相关
我的任务
分享
<li>
<h4><a target="_blank" href='http://item.taobao.com/auction/item_detail-0db1-c010d1d1e4bd4480741f6e84e41e6abe.htm'>LEVIS 夏款个性印花短袖T恤 M号咖啡色</a></h4>
<div class="item">
<div class="pic">
<a target="_blank" href='http://item.taobao.com/auction/item_detail-0db1-c010d1d1e4bd4480741f6e84e41e6abe.htm'>
<img src="http://img05.taobaocdn.com/bao/uploaded/i5/T1N2phXoEa.0JDtMIT_011723.jpg_160x160.jpg" />
</a>
</div>
<div class="desc">
<a target="_blank" href='http://item.taobao.com/auction/item_detail-0db1-c010d1d1e4bd4480741f6e84e41e6abe.htm' class="permalink"> LEVIS 夏款个性印花短袖T恤 M号咖啡色 </a>
</div>
<div class="price">
<span>一口价</span><strong>45.00元</strong>
</div>
<div class="remain-date">剩余 6天</div>
</div>
</li>
<li>
<h4><a target="_blank" href='http://item.taobao.com/auction/item_detail-0db1-6e41151338a1d0e920a7c71ee7ca2042.htm'>Levis 可爱公仔BE@RBRICK熊暴力熊 联名T恤 019黑色XL号</a></h4>
<div class="item">
<div class="pic">
<a target="_blank" href='http://item.taobao.com/auction/item_detail-0db1-6e41151338a1d0e920a7c71ee7ca2042.htm'>
<img src="http://img06.taobaocdn.com/bao/uploaded/i6/T1wvthXkrV.0K3gLw._112210.jpg_160x160.jpg" />
</a>
</div>
<div class="desc">
<a target="_blank" href='http://item.taobao.com/auction/item_detail-0db1-6e41151338a1d0e920a7c71ee7ca2042.htm' class="permalink"> Levis 可爱公仔BE@RBRICK熊暴力熊 联名T恤 019黑色XL号 </a>
</div>
<div class="price">
<span>一口价</span><strong>40.00元</strong>
</div>
<div class="remain-date">剩余 6天</div>
</div>
</li>
这样的正则要是写起来太长了,差不多吧HTML写了一遍
<div class=""remain-date"">(.*?)</div>
<strong>(.*?)</strong>
<span>.*?</span>
<img[^>]*src=(""(?<src>[^""]*)""|'(?<src>[^']*)'|(?<src>[^\s>]*))[^>]*>
string s = "..";
Regex re = new Regex(@"<a[^>]*href=(""(?<href>[^""]*)""|'(?<href>[^']*)'|(?<href>[^\s>]*))[^>]*>(?<text>.*?)</a>", RegexOptions.IgnoreCase | RegexOptions.Singleline);
Match m = re.Match(s);
if(m.Success)
{
string link = m.Groups["href"].Value;
string text = Regex.Replace(m.Groups["text"].Value,"<[^>]*>","");
Console.WriteLine("link:{0}\ntext:{1}", link, text);
}