使用客户端 Java 重现 BioGrakn 文本挖掘示例时出现 OutOfMemoryError

我正在尝试从白皮书“文本挖掘知识图”中复制 BioGrakn 示例，目的是稍后从我的（非生物医学）文档集合中构建文本挖掘知识图。因此，我从biograkn 存储库中的文本挖掘用例的类和数据中构建了一个 Maven 项目。我的 pom.xml 看起来像这样：

<groupId>TextMining-BioGrakn</groupId>

<artifactId>TextMining-BioGrakn</artifactId>

<version>0.0.1-SNAPSHOT</version>

<name>TextMining-BioGrakn</name>

<id>repo.grakn.ai</id>

<url>https://repo.grakn.ai/repository/maven/</url>

</repository>

</repositories>

<groupId>io.grakn.client</groupId>

</dependency>

<groupId>io.grakn.core</groupId>

<artifactId>concept</artifactId>

</dependency>

<groupId>io.graql</groupId>

</dependency>

<groupId>edu.stanford.nlp</groupId>

<artifactId>stanford-corenlp</artifactId>

</dependency>

<groupId>edu.stanford.nlp</groupId>

<artifactId>stanford-corenlp</artifactId>

<classifier>models</classifier>

</dependency>

</dependencies>

</project>

迁移模式、插入已发布的文章和训练模型非常有效，但后来我得到了一个java.lang.OutOfMemoryError: GC overhead limit exceeded，它被抛入mineText()CoreNLP 类的方法中。Migrator 类中的主要方法如下所示：

public class Migrator {

public static void main(String[] args) {

GraknClient graknClient = new GraknClient("localhost:48555");

GraknClient.Session session = graknClient.session("text_mining");

您知道什么可能导致此错误吗？我在这里错过了一些基本的东西吗？非常感谢任何帮助。

慕莱坞森

浏览 172回答 0