C# 在字符串中插入字符串排除Html算法

self001 2014-05-20 09:42:23

这个应该怎么写,,有没有大神帮个忙,没思路啊

...全文

176 7 打赏收藏转发到动态举报

写回复

用AI写文章

7 条回复

切换为时间正序

请发表友善的回复…

发表回复

jimil 2014-05-23

打赏
举报

我想你的目的，应该2楼已经给出了方法，只不过这些正则你得自己完成，然后replace加上你需要插入的内容。

self001 2014-05-23

打赏
举报

引用 1 楼 zhouxiulu 的回复:

你说什么排除HTML啊?是不要HTML中的各个标签吗?如果是这个的话用Replace就可以了

是的，不过插入了相关的字符串后得替换回来。不能插入在html中。。

self001 2014-05-21

打赏
举报

谢谢大家，可能是理解错了。不只是要排除html，还要插入内容，内容不能插入在html标签内，是这个意思。。

蝶恋花雨 2014-05-21

打赏
举报

#region 过滤html,js,css代码
    /// <summary>
    /// 过滤html,js,css代码
    /// </summary>
    /// <param name="html">参数传入</param>
    /// <returns></returns>
    public static string CheckStr(string html)
    {
        System.Text.RegularExpressions.Regex regex1 = new System.Text.RegularExpressions.Regex(@"<script[\s\S]+</script *>", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex2 = new System.Text.RegularExpressions.Regex(@" href *= *[\s\S]*script *:", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex3 = new System.Text.RegularExpressions.Regex(@" no[\s\S]*=", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex4 = new System.Text.RegularExpressions.Regex(@"<iframe[\s\S]+</iframe *>", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex5 = new System.Text.RegularExpressions.Regex(@"<frameset[\s\S]+</frameset *>", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex6 = new System.Text.RegularExpressions.Regex(@"\<img[^\>]+\>", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex7 = new System.Text.RegularExpressions.Regex(@"</p>", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex8 = new System.Text.RegularExpressions.Regex(@"<p>", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex9 = new System.Text.RegularExpressions.Regex(@"<[^>]*>", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex10 = new System.Text.RegularExpressions.Regex(@" ", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex11 = new System.Text.RegularExpressions.Regex(@">", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex12 = new System.Text.RegularExpressions.Regex(@"<", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        html = regex1.Replace(html, ""); //过滤<script></script>标记 
        html = regex2.Replace(html, ""); //过滤href=javascript: (<A>) 属性 
        html = regex3.Replace(html, " _disibledevent="); //过滤其它控件的on...事件 
        html = regex4.Replace(html, ""); //过滤iframe 
        html = regex5.Replace(html, ""); //过滤frameset 
        html = regex6.Replace(html, ""); //过滤frameset 
        html = regex7.Replace(html, ""); //过滤frameset 
        html = regex8.Replace(html, ""); //过滤frameset 
        html = regex9.Replace(html, "");
        html = regex10.Replace(html, "");//过滤空格
        html = regex11.Replace(html, "");
        html = regex12.Replace(html, "");
        html = html.Replace(" ", "");
        html = html.Replace("</strong>", "");
        html = html.Replace("<strong>", "");
        return html;
    }
    #endregion


    #region//过滤简单的HTML代码去
    /// <summary>
    /// 过滤简单的HTML代码去 图片空格。之类
    /// </summary>
    public static string Subfilter(string html)
    {
        System.Text.RegularExpressions.Regex regex1 = new System.Text.RegularExpressions.Regex(@"\<img[^\>]+\>", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex2 = new System.Text.RegularExpressions.Regex(@"</p>", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex3 = new System.Text.RegularExpressions.Regex(@"<p>", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex4 = new System.Text.RegularExpressions.Regex(@"<[^>]*>", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex5 = new System.Text.RegularExpressions.Regex(@" ", System.Text.RegularExpressions.RegexOptions.IgnoreCase);
        System.Text.RegularExpressions.Regex regex6 = new System.Text.RegularExpressions.Regex(@">", System.Text.RegularExpressions.RegexOptions.IgnoreCase);

        html = regex1.Replace(html, ""); //过滤frameset 
        html = regex2.Replace(html, ""); //过滤frameset 
        html = regex3.Replace(html, ""); //过滤frameset 
        html = regex4.Replace(html, "");
        html = regex5.Replace(html, "");//过滤空格
        html = regex6.Replace(html, "");
        html = html.Replace(" ", "");
        html = html.Replace("</strong>", "");
        html = html.Replace("<strong>", "");
        return html;
    }
    #endregion

    #region//去除HTML标记用于过滤掉FckEditor中的HTML标记
    /// <param name="strHtml">包括HTML的源码 </param>
    /// <returns>已经去除后的文字</returns>
    public static string StripHTML(string strHtml)
    {
        string[] aryReg ={
          @"<script[^>]*?>.*?</script>",

          @"<(\/\s*)?!?((\w+:)?\w+)(\w+(\s*=?\s*(([""'])(\\[""'tbnr]|[^\7])*?\7|\w+)|.{0})|\s)*?(\/\s*)?>",
          @"([\r\n])[\s]+",
          @"&(quot|#34);",
          @"&(amp|#38);",
          @"&(lt|#60);",
          @"&(gt|#62);",
          @"&(nbsp|#160);",
          @"&(iexcl|#161);",
          @"&(cent|#162);",
          @"&(pound|#163);",
          @"&(copy|#169);",
          @"&#(\d+);",
          @"-->",
          @"<!--.*\n"
        
         };

        string[] aryRep = {
           "",
           "",
           "",
           "\"",
           "&",
           "<",
           ">",
           " ",
           "\xa1",//chr(161),
           "\xa2",//chr(162),
           "\xa3",//chr(163),
           "\xa9",//chr(169),
           "",
           "\r\n",
           ""
          };

        string newReg = aryReg[0];
        string strOutput = strHtml;
        for (int i = 0; i < aryReg.Length; i++)
        {
            Regex regex = new Regex(aryReg[i], RegexOptions.IgnoreCase);
            strOutput = regex.Replace(strOutput, aryRep[i]);
        }

        strOutput.Replace("<", "");
        strOutput.Replace(">", "");
        strOutput.Replace("\r\n", "");
        strOutput.Replace(" ", " ");

        return strOutput;
    }
    #endregion

三个你随便选一个

郑州高新区WPF小王子 2014-05-21

打赏
举报

去掉 html 标签？？使用replace方法。

bwangel 2014-05-20

打赏
举报

var htmlStr = "<body> <div>aaa </div></body>";

var result = Regex.Replace("<[^>]*>", "");

zhouxiulu 2014-05-20

打赏
举报

你说什么排除HTML啊?是不要HTML中的各个标签吗?如果是这个的话用Replace就可以了

6. **字符串加密**：在C#中，可以使用System.Security.Cryptography命名空间下的类进行字符串加密。例如，AES（高级加密标准）、RSA（公钥加密算法）和SHA（安全散列算法）等，这些加密算法可用于保护敏感信息的安全...

在.NET框架中，C#语言提供了丰富的类库，使得开发者能够高效地进行各种操作。"C#常用类库(100多个)"这个资源包涵盖了众多实用的编程领域，包括文件处理、网络通信、HTTP交互、多线程、UI控件、Office文档操作、输入/...

1. **字符艺术**：源码中可能包含了如何用C#语言生成字符图像的算法，这可能涉及到字符串处理、循环结构以及条件判断等基本编程概念。 2. **控制台输出**：由于字符图片通常在控制台上呈现，开发者可能使用了C#的`...

首先，在C#中，我们可以通过字符串的ToCharArray()方法将字符串转换为字符数组，然后使用LINQ中的Distinct()方法来去除重复元素，并再次转换为字符串。对于字符串中的重复字符，我们可以通过简单的算法来进行去重...

该代码接受一个字符串参数，将其压缩后返回一个新的字符串。接着，我们遍历输入字符串中的每个字符，同时使用...以上就是使用C#实现字符串压缩算法的方法，你可以将其应用于你的代码中以提高程序的性能和减少存储空间。

111,098

社区成员

642,554

社区内容

发帖

与我相关

我的任务

社区管理员

加入社区

近7日
近30日
至今

加载中

查看更多榜单

社区公告

让您成为最强悍的C#开发者

试试用AI创作助手写篇文章吧

+ 用AI写文章