When I pars xml (scraping Google RSS) national symbol (Cyrillic) is breaking:
>xml <- xmlTreeParse(url, useInternalNodes = T)
>xml
<? xml version="1.0" encoding="UTF‑8"?>
<rss version="2.0">
<channel>
<generator>NFE/1.0</generator>
<title>югра OR ханты OR хмао – Новости Google</title>
…