a new way to anti scrapping
- 1 minutes read - 143 wordsI loved to read novels. Recently I found that several sites show some garbed characters in square boxes. I went to the original site qidian and those chacters are shown correctly in the orignal site. After little investigation, I found the following information;
In above pics, 了’s HTML entity code is , it is not as 天.
How does chrome show the characters correctly? What’s the magic? Initially I thougt it was similar as o’reilly. I set break on subtree modification in chrome to locate the code. However there is no luck. Refreshing the webpage,the content of the page is same as the one from server. Something I missed ? After close look, I noticed there is a style elment before the <section class="read-section jsChapterWrapper j_581616852". Expanded the style element. Voilà, the magic is here. It use Embedded OpenType to implement that. https://en.wikipedia.org/wiki/Embedded_OpenType