There isn't really a "raw byte form". The characters are encoded somehow, in all forms.
http://en.wikipedia.org/wiki/Charact...odings_in_HTML

If you want to go back to a codepage encoded string:
Code:
CStringA cp_text = CW2AEX<>(wide_text, cp);
gg