如何操作很长的字符串以避免 golang 内存不足

正如您所发现的，如果您实际生成字符串，您最终将在 RAM 中拥有巨大的内存块。表示“传入字节的大序列”的一种常见方法是将其实现为io.Reader（您可以将其视为字节流），并让您的代码运行一个r.Read(buff)循环。鉴于您提到的练习的具体情况（固定字符串重复n次数），特定字母的出现次数也可以直接根据该字母在中出现的次数s以及更多内容（我会让您弄清楚应该做哪些乘法和计数）。如何实现一个重复字符串而不分配 10^12 倍字符串的阅读器？请注意，在实现该.Read()方法时，调用者已经分配了他的缓冲区。您不需要在内存中重复您的字符串，您只需要用正确的值填充缓冲区——例如，将数据逐字节复制到缓冲区中。这是一种方法：type RepeatReader struct {    str   string    count int}func (r *RepeatReader) Read(p []byte) (int, error) {    if r.count == 0 {        return 0, io.EOF    }    // at each iteration, pos will hold the number of bytes copied so far    var pos = 0    for r.count > 0 && pos < len(p) {        // to copy slices over, you can use the built-in 'copy' method        // at each iteration, you need to write bytes *after* the ones you have already copied,        // hence the "p[pos:]"        n := copy(p[pos:], r.str)        // update the amount of copied bytes        pos += n        // bad computation for this first example :        // I decrement one complete count, even if str was only partially copied        r.count--    }    return pos, nil}https://go.dev/play/p/QyFQ-3NzUDV要获得完整、正确的实施，您还需要跟踪下次.Read()调用时需要开始的偏移量：type RepeatReader struct {    str    string    count  int    offset int}func (r *RepeatReader) Read(p []byte) (int, error) {    if r.count == 0 {        return 0, io.EOF    }    var pos = 0    for r.count > 0 && pos < len(p) {        // when copying over to p, you should start at r.offset :        n := copy(p[pos:], r.str[r.offset:])        pos += n        // update r.offset :        r.offset += n        // if one full copy of str has been issued, decrement 'count' and reset 'offset' to 0        if r.offset == len(r.str) {            r.count--            r.offset = 0        }    }    return pos, nil}https://go.dev/play/p/YapRuioQcOz您现在可以a在遍历此 Reader 时计算 s。

如何操作很长的字符串以避免 golang 内存不足

2回答