类库
› grobid
kermitt2/grobid
GROBID是一个机器学习软件,用于从学术文献中提取信息,将原始文档(如PDF)转换为结构化的XML/TEI编码文档,特别适用于技术和科学出版物。
标签
技术栈
根目录 java
查看全部依赖 (44)
依赖
black.ninia:jep
4.3.1
ch.qos.logback:logback-classic
1.2.13
com.cybozu.labs:langdetect
1.1-20120112
com.fasterxml.jackson.core:jackson-core
2.21.1
com.fasterxml.jackson.core:jackson-databind
2.21.1
com.fasterxml.jackson.dataformat:jackson-dataformat-yaml
2.21.1
com.fasterxml.jackson.module:jackson-module-afterburner
2.21.1
com.github.pemistahl:lingua
1.2.2
com.google.guava:guava
33.5.0-jre
com.rockymadden.stringmetric:stringmetric-core_2.10
0.27.3
com.rockymadden.stringmetric:stringmetric-core_2.11
0.27.4
commons-dbutils:commons-dbutils
1.8.1
commons-io:commons-io
2.21.0
commons-pool:commons-pool
1.6
io.dropwizard.metrics:metrics-core
4.2.38
io.dropwizard.metrics:metrics-servlets
4.2.38
io.dropwizard.modules:dropwizard-testing-junit4
4.0.16
io.dropwizard:dropwizard-assets
4.0.17
io.dropwizard:dropwizard-auth
4.0.17
io.dropwizard:dropwizard-bom
4.0.17
io.dropwizard:dropwizard-client
4.0.17
io.dropwizard:dropwizard-core
4.0.17
io.dropwizard:dropwizard-forms
4.0.17
io.dropwizard:dropwizard-json-logging
4.0.17
io.dropwizard:dropwizard-testing
4.0.17
io.prometheus:simpleclient_dropwizard
0.16.0
io.prometheus:simpleclient_servlet
0.16.0
javax.activation:activation
1.1.1
javax.xml.bind:jaxb-api
2.3.0
joda-time:joda-time
2.14.0
me.tongfei:progressbar
0.9.5
net.arnx:jsonic
1.3.10
net.sf.saxon:Saxon-HE
9.6.0-9
org.apache.commons:commons-collections4
4.5.0
org.apache.commons:commons-lang3
3.20.0
org.apache.commons:commons-text
1.15.0
org.apache.httpcomponents:httpclient
4.5.14
org.apache.lucene:lucene-analyzers-common
4.5.1
org.apache.opennlp:opennlp-tools
1.9.4
org.apache.pdfbox:pdfbox
2.0.35
org.slf4j:slf4j-api
1.7.36
ru.vyarus:dropwizard-guicey
7.3.1
xerces:xercesImpl
2.12.2
xom:xom
1.3.9