继续浏览精彩内容
慕课网APP
程序员的梦工厂
打开
继续
感谢您的支持,我会继续努力的
赞赏金额会直接到老师账户
将二维码发送给自己后长按识别
微信支付
支付宝支付

java TXT 文本的读写 获取文本编码 小尾巴

尘T陌
关注TA
已关注
手记 17
粉丝 10
获赞 81

文本的简单读:

String ReadPath = "dataset/textData/De_";
FileReader f2= new FileReader(ReadPath+(ij+1)+"_.txt");
BufferedReader readerText= new BufferedReader(f2);
for(String line = readerText.readLine(); line != null; line = readerText.readLine()){
if (line.replaceAll("[,。、|~!@#¥; 【》‘“”、】《·\\-??:,:{}%……&*()\\=—+ A-Za-z0-9]","").trim().equals("")) {
continue;}
//去除纯数字与字母的句子
if (line.trim().equals("")) {//去除空行
continue;}
System.out.println(line);
}
readerText.close();

简单写:

String WritPath = "dataset/TrainInput/train_set";//训练 数据保存 位置
FileWriter writerText = new FileWriter(WritPath+"_.txt", true); 
writerText.write(String  text);
writerText.write("\r\n");
writerText.close();

按编码读:

        // 获取文件名
        File r1 = new File("filename");
        // 文件流读取
        FileInputStream r2 = new FileInputStream(r1);
        // 文件流写入缓存
        BufferedInputStream r3 = new BufferedInputStream(r2);
        // 按编码流读取
        InputStreamReader r4 = new InputStreamReader(r3,"utf-8");
        // 从缓存读取
        BufferedReader r5 = new BufferedReader(r4,1024*1024);
        for (String r6 = r5.readLine(); r5.readLine()!=null; r6=r5.readLine()) {
            System.out.println(r6);
        }
        r5.close();     

按编码写:

//  文件名获取
        File w1 = new File("filename");
        //  文件流输出
        FileOutputStream w2 = new FileOutputStream(w1);
        // 以编码流的形式输出
        OutputStreamWriter w3 = new OutputStreamWriter(w2,"GBK");
        w3.write("");
        w3.close();

获取TXT文本编码格式

private static String getCharsetCode(String fileName) throws IOException{  

        BufferedInputStream bin = new BufferedInputStream(new FileInputStream(fileName));    
        int p = (bin.read() << 8) + bin.read();    

        String code = null;    

        switch (p) {    
            case 0xefbb:    
                code = "UTF-8";    
                break;    
            case 0xfffe:    
                code = "Unicode";    
                break;    
            case 0xfeff:    
                code = "UTF-16BE";    
                break;    
            default:    
                code = "GBK";    
        }    
        return code;  
    }  
打开App,阅读手记
1人推荐
发表评论
随时随地看视频慕课网APP