我有一个 Rtf 文件,我需要读取文件来解析。文件中有一些特殊字符,因为文件中有图像。当我从文件中读取所有文本时,无法读取特殊字符后面的内容。
ReadAllText我尝试使用withEncoding.UTF8和读取文件Encoding.ASCII
public class ReadFile
{
public static string GetFileContent(string path)
{
if (!File.Exists(path))
{
throw new FileNotFoundException();
}
else
{
// I also tried
// return File.ReadAllText(path, Encoding.ASCII);
string text = string.Empty;
var fileStream = new FileStream(path, FileMode.Open, FileAccess.Read);
using (var streamReader = new StreamReader(fileStream, Encoding.UTF8))
{
string line;
while ((line = streamReader.ReadLine()) != null)
{
text += line;
}
}
return text;
}
}
}
实际上我的结果是所有文本,直到开始特殊字符。
{\rtf1\ansi\ansicpg1252\deff0\deftab720{\fonttbl{\f0\fnil Times New Roman;}{\f1\fnil Arial;}}{\colortbl;\red000\green000\blue000;\red255\green000\blue000 ;\red128\green128\blue128;}\paperw11905\paperh16837\margl360\margr360\margt360\margb360 \sectd \sectdefaultcl \marglsxn360\margrsxn360\margtsxn360\margbsxn360{ *\do\dobxpage\dobypage\do dhgt819 2\dptxbx{\dptxbxtext\ pard\plain {\pict\wmetafile8\picw19499\pich1746\picwgoal1305695\pichgoal116957\bin342908
蝴蝶刀刀
相关分类