LeetCode 1410. HTML 实体解析器（哈希map）

1. 题目

「HTML 实体解析器」是一种特殊的解析器，它将 HTML 代码作为输入，并用字符本身替换掉所有这些特殊的字符实体。

HTML 里这些特殊字符和它们对应的字符实体包括：

双引号：字符实体为 &quot; ，对应的字符是 " 。
单引号：字符实体为 &apos; ，对应的字符是 ' 。
与符号：字符实体为 &amp; ，对应对的字符是 & 。
大于号：字符实体为 &gt; ，对应的字符是 > 。
小于号：字符实体为 &lt; ，对应的字符是 < 。
斜线号：字符实体为 &frasl; ，对应的字符是 / 。

复制

给你输入字符串 text ，请你实现一个 HTML 实体解析器，返回解析器解析后的结果。

示例 1：
输入：text = "&amp; is an HTML entity but &ambassador; is not."
输出："& is an HTML entity but &ambassador; is not."
解释：解析器把字符实体 &amp; 用 & 替换

示例 2：
输入：text = "and I quote: &quot;...&quot;"
输出："and I quote: \"...\""

示例 3：
输入：text = "Stay home! Practice on Leetcode :)"
输出："Stay home! Practice on Leetcode :)"

示例 4：
输入：text = "x &gt; y &amp;&amp; x &lt; y is always false"
输出："x > y && x < y is always false"

示例 5：
输入：text = "leetcode.com&frasl;problemset&frasl;all"
输出："leetcode.com/problemset/all"
 
提示：
1 <= text.length <= 10^5
字符串可能包含 256 个ASCII 字符中的任意字符。

复制

来源：力扣（LeetCode）

链接：https://leetcode-cn.com/problems/html-entity-parser

著作权归领扣网络所有。商业转载请联系官方授权，非商业转载请注明出处。

2. 解题

遍历text，遇到 & 开始累积字符，遇到 ; 结束累积，查找该单词在哈希表中与否，在则进行替换

class Solution {
public:
    string entityParser(string text) {
    	unordered_map<string,string> m = {{"&quot;","\""},{"&apos;","'"},{"&amp;","&"},
    		{"&gt;",">"},{"&lt;","<"},{"&frasl;","/"}};
    	string word;
    	string ans;
    	for(int i = 0; i < text.size(); ++i)
    	{
    		if(text[i] != '&')
    			ans += text[i];
    		else
    		{
    			word = "";
    			while(i < text.size())
    			{
    				word += text[i];
    				if(text[i]==';')
    					break;
    				i++;
    			}
    			if(m.count(word))
    				ans += m[word];
    			else
    				ans += word;
    		}
    	}
    	return ans;
    }
};

复制

288 ms 19.2 MB