知识图谱构建技术综述
如果无法正常显示,请先停止浏览器的去广告插件。
1. DOI :
10.
7544
i
s
sn1000-1239.
2016.
20148228
?
53 (
3 ): 582-600 , 2016
J
ou
r
na
l o
f Compu
t
e
r Re
s
e
a
r
ch and De
ve
l
opmen
t
(
610054 )
(
l
i
u @ue
s
t
c.
edu.
cn )
q
Knowl
e
dg
e Gr
aph Con
s
t
r
u
c
t
i
on Te
chn
i
e
s
qu
L
i
u Qi
ao , L
i Yang , Duan Hong , L
i
u Yao , and Qi
n Zh
i
guang
(
Scho
o
l of Info
rma
t
i
on and Softwar
e Engi
ne
e
r
i
ng , Un
i
v
e
r
s
i
t
e
c
t
r
on
i
c Sc
i
enc
e and Te
chno
l
ogy of Ch
i
na , Chengdu
y of El
610054 )
Ab
s
t
r
a
c
t Goog
l
e
s knowl
edge g
e
chno
l
ogy ha
s d
r
awn a l
o
t o
f r
e
s
e
a
r
ch a
t
t
en
t
i
ons i
n r
e
c
en
t ye
r
aph t
a
r
s.
Howeve
r , due t
o t
he l
imi
t
ed pub
i
s
c
l
osur
e o
f t
e
chn
i
c
a
l de
t
a
i
l
s , peop
i
nd i
t d
i
f
f
i
cu
l
t t
o unde
r
s
t
and
l
i
c d
l
e f
t
a
t
i
on and va
l
ue o
f t
h
i
s t
e
chno
l
ogy.I
n t
h
i
s pape
r , we i
n
t
r
oduc
e t
he key t
e
chn
i
s i
nvo
l
ved
t
he conno
que
,
i
n t
he cons
t
ruc
t
i
on o
f knowl
edge g
r
aph i
n a bo
t
t
om-up way s
t
a
r
t
i
ng f
r
om a c
l
e
a
r
l
f
i
ned conc
ep
t
y de
r
aph.F
i
r
s
t
l
and a t
e
chn
i
c
a
l a
r
ch
i
t
e
c
t
ur
e o
f t
he knowl
edge g
s
c
r
i
be i
n de
t
a
i
l t
he de
f
i
n
i
t
i
on and
y , we de
f t
he knowl
edge g
hen we pr
he t
e
chn
i
c
a
l f
r
amewo
rk f
o
r knowl
edge
conno
t
a
t
i
on o
r
aph , and t
opos
e t
t
ruc
t
i
on , i
n wh
i
ch t
he cons
t
ruc
t
i
on pr
s d
i
v
i
ded i
n
t
o t
hr
e
e l
eve
l
s a
c
co
rd
i
ng t
o t
he
r
aph cons
oc
e
s
s i
g
abs
t
r
a
c
t l
eve
l o
f t
he i
npu
t knowl
edge ma
t
e
r
i
a
l
s , i
nc
l
ud
i
ng t
he i
n
f
o
rma
t
i
on ex
t
r
a
c
t
i
on l
aye
r , t
he
oc
e
s
s
i
ng l
knowl
edge i
n
t
eg
r
a
t
i
on l
aye
r , and t
he knowl
edge pr
aye
r , r
e
spe
c
t
i
ve
l
cond
l
he
y.Se
y , t
r
e
s
e
a
r
ch s
t
a
t
us o
f t
he key t
e
chno
l
og
i
e
s f
o
r e
a
ch l
eve
l a
r
e sur
veyed compr
ehens
i
ve
l
l
so
y and a
e
s o
r
adua
l
l
r
aph
i
nve
s
t
i
t
ed c
r
i
t
i
c
a
l
l
o
r t
he purpos
f g
eve
a
l
i
ng t
he mys
t
e
r
i
e
s o
f t
he knowl
edge g
y r
ga
y f
,
t
e
chno
l
ogy , t
he s
t
a
t
e-o
f-t
he-a
r
t pr
og
r
e
s
s , and i
t
s r
e
l
a
t
i
onsh
i
w
i
t
h
r
e
l
a
t
e
d
d
i
s
c
i
l
i
n
e
s
.F
i
n
a
l
l
f
ve
p
p
y i
ma
o
r r
e
s
e
a
r
ch cha
l
l
enge
s i
n t
h
i
s a
r
e
a a
r
e summa
r
i
z
ed , and t
he co
r
r
e
spond
i
ng key r
e
s
e
a
r
ch i
s
sue
s a
r
e
j
h
i
l
i
t
ed.
gh
gh
Ke
r
d
s knowl
edge g
r
aph ; s
eman
t
i
c Web ; i
n
f
o
rma
t
i
on r
e
t
r
i
eva
l ; s
eman
t
i
c s
e
a
r
ch eng
i
ne ; na
t
ur
a
l
y wo
l
anguage pr
oc
e
s
s
i
ng
,
,
,
.
.
1 )
,
3
:
,
、
;
2 )
,
,
;
3 )
.
;
;
;
;
TP18
:
2014 - 11 - 06 ;
:
“
:
2015 - 04 - 08
”
(
2011AA010706 );
(
61133016 ,
61272527 );
-
( MCM20121041 )
Th
i
s wo
rk wa
s suppo
r
t
ed by t
he Na
t
i
ona
l Hi
chno
l
ogy Re
s
e
a
r
ch and Deve
l
opmen
t Pr
og
r
am o
f Ch
i
na (
863 Pr
og
r
am )
gh Te
(
2011AA010706 ), t
he Na
t
i
ona
l Na
t
ur
a
l Sc
i
enc
e Founda
t
i
on o
f Ch
i
na (
61133016 , 61272527 ), and Mi
n
i
s
t
r
f Educ
a
t
i
on-
y o
Ch
i
naMob
l
i
e Commun
i
c
a
t
i
ons Co
r
r
a
t
i
on Re
s
e
a
r
ch Funds ( MCM20121041 ) .
po
2. :
5 8 3
,
, Web
,
( Web 1.
0 )
.
( HowNe
t )
、
、
,
(
l
i
nked da
t
a ), Web
.
[
1 ]
Web
,
Be
rne
r
s-Le
e
(
s
eman
t
i
c Web )
.
,
W3C
,
.
、
( We
b o
f d
a
t
a ),
,
,
.
XLo
r
e ③ 、
1
①
(
knowl
e
dg
e
.
r
aph )
g
(
OpenKN )
、
“
、
( Knowwa
r
e )
.
,
、
、
,
.
zh
i
sh
i.me ④ 、
[
2 ]
,
⑤
GDM
,
”
,
,
,
.
,
.
,
,
.
.
.
2012
5
17
,
,
.
Me
t
aweb
,
,
,
2010
,
.
,
,
,
,
.
,
.
.
,
1
2006 ,
Be
rne
r
s-Le
e
(
,
l
i
nked da
t
a )
URI (
un
i
f
o
rm r
e
sour
c
e i
den
t
i
f
i
e
r ),
.
RDF (
r
e
sour
c
e de
s
c
r
i
t
i
on f
r
amewo
rk ),
OWL ( Web
p
on
t
o
l
ogy l
anguage ),
②
, :
.
,
.
,
.
,
Sa
t
o
r
i
,
t
t
w3.
o
r
s
i
I
s
sue
s
i
nkedDa
t
a.
h
tml
② h
?
? www.
? De
? L
p :
g
gn
t
t
x
l
o
r
e.
o
r
i
ndex.
a
c
t
i
on
③ h
?
?
?
p :
g
t
t
zh
i
sh
i.
apex
l
ab.
o
r
④ h
?
?
p :
g
t
t
f
udan.
edu.
cn
⑤ h
?
?
p :
gdm.
t
t
en.
wi
k
i
i
a.
o
r
k
i
edge Gr
aph
⑥ h
?
?
? wi
? Knowl
p :
ped
g
Sa
t
o
r
i
,
,
t
t
w3.
o
r
s
t
anda
r
ds
s
eman
t
i
cweb
da
t
a
① h
?
? www.
?
?
?
p :
g
,
We
i
t
z
.
.
2013 7
(
B
i
ng )
,
⑥
.
,
,
3. 2016 , 53 (
3 )
5 8 4
,
,
1.
1
1.
. 1
,
,
,
Wo
l
f
r
amAl
pha
,
.
.
5
“
.
10
-
350
,
.
Pr
ob
a
s
e
,
,
Web
,
,
.
Wo
l
f
r
amAl
pha
,
:
.
,
,
1
,
.
:
3
1 )
,
.
,
(
Tab
l
e 1 Knowl
e
dg
e Gr
aph and S
imi
l
a
r Pr
odu
c
t
s
),
1
,
.
.
Knowl
edge Ba
s
e Pr
oduc
t
s Da
t
a Sour
c
e
Knowl
edge
Vau
l
t a
ch Eng
i
ne
Goog
l
e Se
Goog
l
e Now e
eba
s
e ,
Wi
k
i
i
a , Fr
ped
Web Open Da
t
a
Wo
l
f
r
am Al
pha App
l
e S
i
r
i Ma
t
hema
t
i
c
a
B
i
ng Se
a
ch Eng
i
ne
r
t
ana
Mi
c
r
o
s
o
f
t Co Wi
k
i
i
a ,
ped
Web Open Da
t
a
Wa
t
s
on KB t
s
on
IBM Wa
Sys
t
em c
t
i
ona
r
i
e
s
Web Di
The Wo
r
l
d Book
Enc
l
oped
i
a
yc
DBped
i
a KB DBped
i
a Wi
k
i
i
a
ped
YAGO KB YAGO Wi
k
i
i
a
ped
NELL KB NELL Web Open Da
t
a
Fa
c
ebook KB Shopyc
a
t So
c
i
a
l Ne
two
rk
Da
t
a
Zh
i
l
i
f
ang KB Sougou Se
a
ch
Eng
i
ne Web Open Da
t
a
Zh
i
x
i
n KB Ba
i
du Zh
i
x
i
n
P
l
a
t
f
o
rm Us
e
r Gene
r
a
t
ed
Con
t
en
t
XLORE Ch
i
ne
s
e
l
i
sh
? Eng
Enc
l
oped
i
a ,
yc
Wi
k
i
i
a
ped
Zh
i
sh
i.
me KB
Zh
i
sh
i.
me
2 )
,
(
ove
r
l
ay ne
two
rk ),
Web
,
Web
,
,
.
3 )
,
,
(
);
,
.
1.
2
,
(
)
,
.
,
:
2
,
.
Ch
i
ne
s
e Enc
l
oped
i
a
yc
,
1
,
.
[
3 ]
i
ngua
l KB
Cr
o
s
s-L
,
,
.
App
l
e S
i
r
i ,
Goog
l
e Now
Sa
t
o
r
i
oba
s
e
? Pr
”
-
.
,
,
-
,
,
-
-
.
(
f
a
c
t )
Gr
aphd
“
.
”
-
Tr
i
n
i
t
y
”
-
“
,
Fa
c
ebook ,
App
l
e ,
IBM
,
.
,
“
” .
,
,
,
,
.
,
.
,
、
4. :
5 8 5
、
.
,
1
,
.
,
,
(
.
),
,
,
.
,
. 1
,
3
,
:
、
.
F
i
chn
i
c
a
l a
r
ch
i
t
e
c
t
u
r
e o
f knowl
edge g
r
aph.
g.1 Te
1
2
2
.
,
,
;
,
,
1.
2
,
,
,
,
,
:
1 )
3
(
.
,
)、
,
;
2 )
,
,
,
,
;
3 )
,
.
(
),
Knowl
edge Vau
l
t
Sa
t
o
r
i
,
,
,
,
,
,
.
,
、
、
.
、
.
2.
1
(
i
n
f
o
rma
t
i
on ex
t
r
a
c
t
i
on )
,
,
、
,
,
,
Fr
e
eba
s
e
,
3
.
:
1
,
.
5. 2016 , 53 (
3 )
5 8 6
,
[
4 ]
、
.
:
S
t
an
f
o
rd
NER
、
.
,
.
, Web 2.
0
,
2.
1.
1
,
(
named en
t
i
t
y
r
e
cogn
i
t
i
on , NER ),
.
(
.
)
,
,
,
,
(
.
)
[
11 ]
,
(
.
),
,
、
(
[
5 ]
)
[
6 ]
,
Rau
.
1991
.
,
,
,
.
,
[
12 ]
, Wh
i
t
e
l
aw
.
,
,
,
,
.
,
,
,
[
7 ]
,
L
i
u
(
K -Ne
a
r
e
s
t
K -
Ne
i
r
s )
ghbo
,
.
,
Twi
t
t
e
r
.
.
,
,
,
,
,
,
,
.
(
,
)
,
[
8 ]
,
,
L
i
n
,
Med
l
i
ne
.
2.
1.
2
,
GENIA
70%
,
.
,
,
,
(
open doma
i
n )
,
)
(
,
.
,
,
[
13 ]
J
a
i
n
,
,
.
.
,
;
.
,
2002
[
9 ]
,
Sek
i
ne
.
:
1 )
2
,
150
,
;
2 )
.
,
,
.
2012
L
i
ng
[
10 ]
Fr
e
eba
s
e
,
,
.
,
,
,
112
,
,
.
、
Kambha
t
l
a
[
14 ]
,
6. :
5 8 7
[
21 ]
Fade
r
Tex
tRunne
r
WOE
,
.
,
,
,
.
[
15 ]
,
,
( HowNe
t )
,
,
,
ACE
,
6
.Maus
am
,
88%.
,
[
22 ]
,
,
,
.
,
OILL
IE
,
.
[
16 ]
,
Ca
r
l
son
.
,
Boo
t
s
t
r
ap
[
17 ]
.
,
,
Boo
t
s
t
r
app
i
ng
,
.
,
( H-
,
N -Gr
am
[
23 ]
Banko
CRF ),
,
,
,
.
Zhang
[
18 ]
,
,
,
、
,
.
.
S
t
a
tSnowba
l
l
[
24 ]
.
,
、
OIE
,
.
2
.
1 )
(
.
,
2007
Banko
(
op
e
n i
n
f
o
rma
t
i
on e
x
t
r
a
c
t
i
on ,
;
2 )
,
(
s
e
l
f-supe
r
v
i
s
ed )
,
.
[
25 ]
(
Tex
tRunne
r ) .
,
,
N
KRAKEN
,
,
OIE
,
.
[
26 ]
,
-
Al
an
OIE
,
“
,
,
[
19 ]
OIE ),
)
,
McCa
l
l
um
”
-
,
OIE
,
.
,
.
OIE
,
,
,
,
[
20 ]
(
i
n
f
obox )
,
,
, Wu
2.
1.
3
,
OIE
WOE
.
,
,
.
,
Tex
tRunne
r
、
,
、
、
.
.
,
.
7. 2016 , 53 (
3 )
5 8 8
2.
2.
1
,
(
en
t
i
t
i
nk
i
ng )
y l
,
[
27 ]
.
,
[
32 ]
.
.
,
.
,
Suchanek
.
[
28 ]
,
Wi
k
i
i
a
ped
Wo
rdNe
t
,
,
,
( YAGO ),
95%. YAGO
,
, DBped
i
a
Fr
e
eba
s
e
(
co
l
l
e
c
t
i
ve en
t
i
t
i
nk
i
ng ) .
y l
,
[
33 ]
458
.
,
30
.
L
i
nked Da
t
a
,
DBped
i
a
:
1 )
,
,
Han
,
;
2 )
、
,
,
DBped
i
a
[
29 ]
.
,
;
3 )
,
,
.
1 )
.
(
en
t
i
t
i
s
amb
i
t
i
on )
y d
gua
.
.
,
,
,
,
,
“
”
(
)
,
[
30 ]
;
,
,
,
,
.
.
.
,
(
,
,
.
),
,
,
(
①
[
31 ]
.
4
.
) .
,
2.
2
,
,
、
.
,
,
[
34 ]
,
,
,
.
:
2
,
.
,
.
Bagga
, MUC6 ( Me
s
s
a
e Und
e
r
s
t
a
nd
i
ng
g
Con
f
e
r
enc
e )
(
F
84.
6% ) .
,
,
,
②
.
.
,
8. :
5 8 9
,
,
,
. Ra
t
i
nov
, [
42 ]
.
[
41 ]
[
35 ]
Pede
r
s
en
.Ochs
,
,
DBped
i
a
,
,
.
,
.
③
、
.
,
,
2 )
.
(
en
t
i
t
e
so
l
u
t
i
on )
y r
,
.
,
,“
Ba
r
a
c
k Ob
ama ”,“
ama ”,
r
e
s
i
d
e
n
t Ob
p
(
“
t
he pr
e
s
i
den
t ”
),
.
,
“
he ”,“
h
im ” ,
[
36 ]
Ma
l
i
n
,
(
,
.
)
.
.
,
.
④
(
,
)
,
,
,
.
Han
[
37 ]
(
en
t
i
t
t
ch
i
ng )
y ma
(
en
t
i
t
y synonyms ) .
,
[
38 ]
.
,
,
,
,
.
L
i
nden
,
Hobb
s
(
c
en
t
e
r
i
ng t
heo
r
y ) .Hobbs
,
[ ]
,
Sen 39
,
86%
[
40 ]
( ob
e
c
t
j
a
l
i
t )、
gnmen
,
.
Bune
s
cu
:
,
,
.
Shen
.
(
Hobbs
Hobbs
),
[
43 ]
.
Wo
rdne
t
,
.
,
:
(
u
t
t
e
r
anc
e )
(
d
i
s
cour
s
e )
,
,
,
(
.
),
,
[
44 ]
.
[
32 ]
,
L
i
,
, Twi
t
t
e
r
,
.
,
,
.
.
,
,
,
,
.
.
Lapp
i
n
[
45 ]
,
,
.
,
3
9. 2016 , 53 (
3 )
5 9 0
,
Hobbs
2.
2.
2
,
.
,
.McCa
r
t
hy
(
l
i
nked open da
t
a )
,
,
C4.
5
DBped
i
a YAGO ,
, Mus
i
cBr
a
i
nz
MUC-5
[
47 ]
.
Be
an
,
,
Ut
ah
Au
t
oS
l
og
,
Demps
t
e
r-Sha
f
e
r
, 2
,
.
[
46 ]
2
、
,
、
,
,
4
)
76%
87%
; ②
,
.
,
[
52 ]
.
[
53 ]
, Mende
s
.
,
(
l
i
nked da
t
a i
n
t
eg
r
a
t
i
on
f
r
amewo
rk ,
LDIF ),
.
[
48 ]
.Turney
n
f
o
rma
t
i
on ,
PMI )
mu
t
ua
l i
,
TOEFL
,
.
1 )
. ①
( MUC-
DrugBank
(
i
n
twi
s
e
po
.
LOD
: ①
4
; ②
,
,
; ③
ESL
74%
[
49 ]
.
Cheng
,
,
,
; ④
,
,
.
,
,
URL
,
(
c
l
i
ck s
imi
l
a
r
i
t
y ),
,
,
.
2 )
.
,
:
2
(
)
,
.
Pan
t
e
l
[
50 ]
.
,
,
Ha
r
r
i
s
(
RDF )
.
,
RDB2RDF ,
(
t
e
rm s
imi
l
a
r
i
t
y ),
RDF
,
,
.
Chak
r
ab
a
r
t
i
[
51 ]
2
,
(
t
ex
t s
imi
l
a
r
i
t
r
y ),
que
y con
,
,
.
,
200
.
2
MapReduc
e
,
50h
4
B
i
ng
,
,
,
,
( Tr
RDB2RDF
i
l
i
f
p
y , D2R
Se
r
ve
r ,
OpenL
i
nk Vi
r
t
uoso ,
Spa
r
lMap ),
q
,
[
54 ]
, W3C 2012
.
2
: Di
(
r
e
c
t Mapp
i
ng A d
i
r
e
c
t mapp
i
ng o
f
r
e
l
a
t
i
ona
l da
t
a t
o RDF ) R2RML (
RDB t
o RDF
)
,
mapp
i
ng l
anguage .
Di
r
e
c
t Mapp
i
ng
,
5
B
i
ng
.
,
W3C
RDF
, RDF
.
10. :
5 9 1
,
R2RML
,
,
R2RML
,
RDF
,
“
I
sA ”
,
.
,
270
92.
8% ,
[
58 ]
,
.
( XML ,
CSV ,
JSON
)
,
3
:
RDF
、
[
59 ]
.
1 )
.
,
XSPARQL
RDF ,
Da
t
a
l
i
f
t
RDF ,
2
XML
XML
,
CSV
,
,
RDF
,
,
Pr
oba
s
e
.
2
,
.
[
55 ]
,
.
“
.
” “
”
,
2.
3
,
、
“
” “
” 2
,
,
,
.
;
.
2 )
,
,
.
,
、
(
I
sA )
,
.
(
:
3
、
,
,
,
,“
)
,
”
“
,
”
.
3 )
,
.
(
2.
3.
1
(
on
t
o
l
ogy )
,
1
) .
,
2
:
.
,
.
,
,
,
.
,
, :
,
.
(
d
i
s
t
r
i
bu
t
i
ona
l s
imi
l
a
r
i
t
y )
[
56 ]
,
.
“
I
sA ”
(
)
[
60 ]
,
.
,
,
.
,
1
,
N
,
,
1
,
[
57 ]
,
.
(
),
,
.
,
,
,
( He
a
r
s
t
,
.
.
I
sA
[
57 ]
,
KnowI
tAl
l ,
Tex
tRunne
r , NELL
,
,
,
,
.
I
sA
Pr
oba
s
e
,
, Pr
oba
s
e
.
,
)
,
.
,
.
:
,
,
11. 2016 , 53 (
3 )
5 9 2
[
61 ]
.
“
dome
s
t
i
c
Pr
ob
a
s
e
,
an
ima
l
s o
t
he
r t
han dogs such a
s c
a
t
s ”
,
,
,
,
.
,
I
sA
2 :(
c
a
t ,
I
sA ,
dog ) (
c
a
t ,
I
sA ,
dome
s
t
i
c
an
ima
l ) . Pr
oba
s
e
(
A ,
f
r
i
end ,
B ) f
r
i
end
,
.
[
58 ]
,
,
.
,
,
[
62 ]
Wang
B
,
,
,
.
A
,
.
,
,
,
(
de
s
c
r
i
t
i
on l
og
i
c )
p
,
.
.
,
.
,
,
TBox (
t
e
rmi
no
l
ogy box )
,
ABox (
a
s
s
e
r
t
i
on box ),
TBox
, ABox
,
.
[
63 ]
Wang
,
.
,
,
2
ABox
(
t
e
rm co-oc
cur
r
enc
e ne
two
rk )
(
CATHY ),
[
65 ]
,
.
,
[
64 ]
.
L
i
u
(
OWL )
Web
,
(
O (
n l
og n )),
1h
,
OWL
,
100
.
,
.
,
2.
3.
2
,
( s
eman
t
i
c Web ru
l
e l
anguage ,
SWRL )
,
,
.
,
[
66 ]
SWRL
.
,
,
,
. ( , , )
( , , )
( , , ),
( , , ) .
.
Pa
t
h Rank
i
ng
.
(
neur
a
l t
enso
r ne
two
rks )
、
Wo
rdNe
t
,
.
,
86.
2%
,
, ,
),
(
(
, ,
, ,
Pa
t
h Rank
i
ng
(
,
2
,
,
:
.
.
、
2
Pa
r
en
t
o
f
(
X ,
Y )
Pa
r
en
t
o
f
Ma
r
r
i
edTo
,
(
ed
i
c
a
t
i
on )
2
pr
.
1
X
→ Z ←
Y ,
.
(
i
nd
i
v
i
dua
l
s )
90.
0%.
),
,
) .
,
Fr
e
eBa
s
e
.
,
[
67 ]
Soche
r
,
,
) (
Lu
,
[
68 ]
.
Z ,
X
Y
12. :
5 9 3
,
,
、
[
71 ]
,
.
,
.
,
,
、
、
.
,
,
[
69 ]
,
.
.
,
,
,
.
.
,
,
[ ]
91% ,
,
.
80% 72 .
2.
4
[
70 ]
,
,
Tab
l
e
au
,
,
.
,
.
,
,
.
,
.
2.
3.
3
、
.
,
.
1 )
,
、
(
、
(
)
.
,
),
,
,
.
( Fr
)
,
e
eba
s
e
;
2 )
,
,
.
,
,
,
.
:
.
,
,
,
.
,
,
;
.
, Mende
s
[
53 ]
:
2
,
,
.
(
,
LDIF
(
S
i
eve
),
[
52 ]
),
.
,
.
3
REVERB
,
Fade
r
[
21 ]
,
1 000
,
,
,
,
REVERB
.
,
.
,
Knowl
edge Vau
l
t
,
,
,
,
Fr
e
eba
s
e
,
;
2 )
,
.
:
)
1
13. 2016 , 53 (
3 )
5 9 4
;
3 )
12.
47% ,
12.
65%
,
,
2
.
.
3.
2
,
.
,
、
,
.
.
,
(
.
:
1 )
3
),
,
2
;
2 )
;
3 )
,
,
.
.
,
2.
3.
1
,
2
,
,
,
.
,
3.
1
,
,
.
,
,
l
i
ngua
l on
t
o
l
ogy mapp
i
ng
,
Xl
i
ke
XLo
r
e
,
.
Xl
i
ke
,
,
、 、 、 、
,
.
XLo
r
e
,
、
.
:
1 )
3
,
(
,
,
Fu
[
76 ]
SOCOM
r
ende
r
i
ng );
2 )
ma
t
ch
i
ng );
3 )
(
[
73 ]
(
ma
t
ch
i
ng aud
i
t ),
.
Wang
.
[
77 ]
,
.
Nguye
,
a
l
i
t )
gnmen
.
,
、
(
c
r
os
s-
[
74 ]
、
、
,
85.
8% ,
,
88.
1%.
,
,
.
202 141
.
:
1 )
(
)
;
2 )
,
,
.
, Wang
.
[
78 ]
, Wang
[
75 ]
.
1 )
,
( Wi
k
iC
iKE ),
;
2 )
;
3 )
,
,
,
,
.
(
、
, Wi
k
iC
iKE
、
、
)
4
.
.
14. :
5 9 5
,
4
,
,
.
[
79 ]
Yao
、
,
、
)、
.
、
Goog
l
e Now ,
App
l
e S
i
r
i )
IBM Wa
t
son , Wo
l
f
r
am Al
pha
,
Fr
e
eba
s
e
(
,
);
,
(
(
;
Fr
e
eba
s
e
),
Fr
e
eba
s
e
,
.
,
(
,
;
,
.
Be
r
an
t
[
80 ]
Fr
e
eba
s
e
(
l
og
i
c
,
,
f
o
rm );
,
,
;
(
),
.
2 )
.
,
,
,
,
[
81 ]
Fade
r
,
.
( SPARQL ),
Fr
e
eba
s
e
,
.
Be
r
an
t
,
,
,
,
.
、
:“
“
,
,
Fr
e
eba
s
e
,
,
,
, (
,) (
[
82 ]
,
”,
,
Pr
oba
s
e
,
、
,
”,
.
,
,) ,
.
.
,
(
2.
4
5
) .
, 2012
,
,
2
,
,
,
,
.
2
.
,
Pa
r
a
l
ex
SEMPRE
J
a
c
ana-Fr
e
eba
s
e
②
;
③
,
1 )
t
t
c
ode.
l
e.
c
om ?
a
c
ana
① h
?
?
?
ps :
goog
p
j
t
t
t
a
l
l.
c
s.
wa
sh
i
ng
t
on.
edu
r
a
l
ex
② h
?
? knowi
?
p :
pa
t
t
l
s
t
an
f
o
r
d.
edu
s
o
f
twa
r
e
③ h
?
? www-n
?
p :
p.
:
:
①
(
i
n
f
o
rma
t
i
on r
e
t
r
i
eva
l )、
( na
t
ur
a
l l
anguage pr
oc
e
s
s
i
ng )、
( WWW )
(
a
r
t
i
f
i
c
i
a
l i
n
t
e
l
l
i
e )
genc
.
Knowl
edge Vau
l
t
Sa
t
o
r
i
,
15. 2016 , 53 (
3 )
5 9 6
,
.
1 )
.
,
,
(
、
、
6
)
,
、
、
,
.
( Web o
f documen
t )
, 1
,
、
( Web o
f
,
.
.
2 )
、
da
t
a ) .
,
,
.
,
,
.
,
,
.
.
、
、
,
,
.
(
、
、
)
,
,
.
,
,
,
,
.
.
3 )
.
,
.
:
、
、
.
,
[
1 ] Chr
i
s
t
i
an B , He
a
t
h T , Be
r
ne
r
s-Le
e T.L
i
nked da
t
a-t
he s
t
o
r
y
,
s
o f
a
r [
J ] .I
n
t
e
r
na
t
i
ona
l J
our
na
l on Seman
t
i
c Web and
I
n
f
o
rma
t
i
on Sys
t
ems , 2009 , 5 (
3 ): 1-22
[
2 ] Chen Xueq
i , J
i
n Xi
ao
l
ong , Wang Yuanzhuo , e
t a
l.Sur
vey on
.
,
b
i
t
a s
t
em and ana
l
t
i
c t
e
chno
l
ogy [
J ] .J
our
na
l o
f
g da
ys
y
,
So
f
twa
r
e , 2014 , 25 (
9 ): 1889-1908 (
i
n Ch
i
ne
s
e )
,
(
.
4 )
,
,
[
J ] .
,
,
,
.
, 2014 , 25 (
9 ): 1889-1908 )
[
3 ] Wang Yuanzhuo , J
i
a Yan
t
ao , L
i
u Dawe
i , e
t a
l.Open Web
i
ded i
n
f
o
rma
t
i
on s
e
a
r
ch and da
t
a mi
n
i
ng [
J ] .
knowl
edge a
,
.
J
our
na
l o
f Compu
t
e
r Re
s
e
a
r
ch and Deve
l
opmen
t , 2014 , 52
(
2 ): 456-474 (
i
n Ch
i
ne
s
e )
,
(
,
,
,
,
.
[
J ] .
.
5 )
、
,
, 2014 , 52 (
2 ): 456-
474 )
[
4 ] Cowi
e J , Lehne
r
t W. I
n
f
o
rma
t
i
on ex
t
r
a
c
t
i
on [
J ] .
,
Commun
i
c
a
t
i
ons o
f t
he ACM , 1996 , 39 (
1 ): 80-91
,
[
5 ] Ch
i
ncho
r N , Ma
r
sh E.Muc-7i
n
f
o
rma
t
i
on ex
t
r
a
c
t
i
on t
a
sk
de
f
i
n
i
t
i
on [
C ] ?
o
c o
f t
he 7
t
h Me
s
s
age Unde
r
s
t
and
i
ng
? Pr
.
Con
f.Ph
i
l
ade
l
i
a : L
i
ngu
i
s
t
i
c Da
t
a Cons
o
r
t
i
um , 1998 : 359-
ph
,
,
,
SQL
,
.
,
,
367
[
6 ] Rau L F.Ex
t
r
a
c
t
i
ng c
ompany name
s f
r
om t
ex
t [
C ] ?
o
c o
f
? Pr
t
he 7
t
h IEEE Con
f on Ar
t
i
f
i
c
i
a
l I
n
t
e
l
l
i
e App
l
i
c
a
t
i
ons.
genc
P
i
s
c
a
t
away , NJ : IEEE , 1991 : 29-32
16. :
5 9 7
[
7 ] L
i
u Xi
aohua , Zhang Shaod
i
an , We
i Fur
u , e
t a
l.Re
c
ogn
i
z
i
ng [
18 ] Zhang Yimi
ng , Zhou J F.A t
r
a
i
nab
l
e me
t
hod f
o
r ex
t
r
a
c
t
i
ng
named en
t
i
t
i
e
s i
n twe
e
t
s [
C ] ?
o
c o
f t
he 49
t
h Annua
l
? Pr Ch
i
ne
s
e en
t
i
t
s and t
he
i
r r
e
l
a
t
i
ons [
C ] ?
o
c o
f t
he 2nd
? Pr
y name
Me
e
t
i
ng o
f t
he As
s
o
c
i
a
t
i
on f
o
r Compu
t
a
t
i
ona
l L
i
ngu
i
s
t
i
c
s : Wo
rkshop on Ch
i
ne
s
e Language Con
unc
t
i
on wi
t
h t
he 38
t
h
j
Human Language Te
chno
l
og
i
e
s.S
t
r
oudsbur
g , PA : ACL , e
t
i
ng o
f t
he As
s
o
c
i
a
t
i
on f
o
r Compu
t
a
t
i
ona
l
Annua
l Me
2011 : 359-367 L
i
ngu
i
s
t
i
c
s.S
t
r
oudsbur
g , PA : ACL , 2000 : 66-72
[
8 ] L
i
n Yi
f
eng , Ts
a
i Tz
onghan , Chou Wench
i , e
t a
l. A [
19 ] Banko M , Ca
f
a
r
e
l
l
a M J , Sode
r
l
and S , e
t a
l. Open
max
imum en
t
r
opy app
r
oa
ch t
o b
i
omed
i
c
a
l named en
t
i
t
y i
n
f
o
rma
t
i
on ex
t
r
a
c
t
i
on f
o
r t
he Web [
C ] ?
o
c o
f t
he 20
t
h I
n
t
? Pr
r
e
c
ogn
i
t
i
on [
C ] ?
o
c o
f t
he 4
t
h ACM S
IGKDD Wo
rkshop
? Pr J
o
i
n
t Con
f on Ar
t
i
f
i
c
i
a
l I
n
t
e
l
l
i
e. New Yo
rk : ACM ,
genc
on Da
t
a Mi
n
i
ng i
n B
i
o
i
n
f
o
rma
t
i
c
s.New Yo
rk : ACM , 2004 : 2007 : 2670-2676
56-61
[
20 ] Wu Fe
i , We
l
d D S. Open i
n
f
o
rma
t
i
on ex
t
r
a
c
t
i
on us
i
ng
[
9 ] Sek
i
ne S , Sudo K , Noba
t
a C. Ex
t
ended named en
t
i
t
y
Wi
k
i
i
a [
C ] ?
o
c o
f t
he 48
t
h Annua
l Me
e
t
i
ng o
f t
he
? Pr
ped
h
i
e
r
a
r
chy [
C ] ?
o
c o
f t
he 3r
d Language Re
s
our
c
e
s and
? Pr As
s
o
c
i
a
t
i
on f
o
r Compu
t
a
t
i
ona
l L
i
ngu
i
s
t
i
c
s. S
t
r
oudsbur
g ,
f.New Yo
rk : Eur
ope
an Language Re
s
our
c
e
s
Eva
l
ua
t
i
on Con PA : ACL , 2010 : 118-127
As
s
o
c
i
a
t
i
on , 2002 : 1818-1824
[
21 ] Fade
r A , Sode
r
l
and S , Et
z
i
on
i O.I
den
t
i
f
i
ng r
e
l
a
t
i
ons f
o
r
y
[
10 ] L
i
ng Xi
ao , We
l
d D.S.F
i
ne-g
t
i
t
e
c
ogn
i
t
i
on [
C ] ?
r
a
i
ned en
?
y r open i
n
f
o
rma
t
i
on ex
t
r
a
c
t
i
on [
C ] ?
o
c o
f t
he Con
f on
? Pr
Pr
o
c o
f t
he 26
t
h Con
f on As
s
o
c
i
a
t
i
on f
o
r t
he Advanc
emen
t o
f Emp
i
r
i
c
a
l Me
t
hods i
n Na
t
ur
a
l Language Pr
o
c
e
s
s
i
ng.
Ar
t
i
f
i
c
i
a
l I
n
t
e
l
l
i
e.Men
l
o Pa
rk , CA : AAAI , 2012 : 94-
genc S
t
r
oudsbur
g , PA : ACL , 2011 : 1535-1545
100
[
22 ] Maus
am , Schmi
t
z M , Ba
r
t R , e
t a
l.Open l
anguage l
e
a
r
n
i
ng
[
11 ] Zhao Jun , L
i
u kang , Zhou Guangyou , e
t a
l. Open
i
n
f
o
rma
t
i
on ex
t
r
a
c
t
i
on [
J ] .J
our
na
l o
f Ch
i
ne
s
e I
n
f
o
rma
t
i
on
6 ): 98-110 (
i
n Ch
i
ne
s
e )
Pr
o
c
e
s
s
i
ng , 2011 , 25 (
(
,
,
, .
Emp
i
r
i
c
a
l Me
t
hods i
n Na
t
ur
a
l Language Pr
o
c
e
s
s
i
ng and
Compu
t
a
t
i
ona
l Na
t
ur
a
l Language Le
a
r
n
i
ng.S
t
r
oudsbur
g ,
[
J ] .
, 2011 , 25 (
6 ): 98-110 )
[
12 ] Wh
i
t
e
l
aw C , Keh
l
enbe
ck A , Pe
t
r
ov
i
c N , e
t a
l.Web-s
c
a
l
e
named en
t
i
t
e
c
ogn
i
t
i
on [
C ] ?
o
c o
f t
he 17
t
h ACM Con
f
? Pr
y r
n
f
o
rma
t
i
on and Knowl
edge Managemen
t.New Yo
rk :
on I
ACM , 2008 : 123-132
[
13 ] J
a
i
n A , Penna
c
ch
i
o
t
t
i M.Open en
t
i
t
t
r
a
c
t
i
on f
r
om Web
y ex
s
e
a
r
ch que
r
ogs [
C ] ?
o
c o
f t
he 23r
d I
n
t Con
f on
? Pr
y l
Compu
t
a
t
i
ona
l L
i
ngu
i
s
t
i
c
s.S
t
r
oudsbur
g , PA : ACL , 2010 :
510-518
[
14 ] Kambha
t
l
a N.Comb
i
n
i
ng l
ex
i
c
a
l , s
t
a
c
t
i
c , and s
eman
t
i
c
yn
f
e
a
t
ur
e
s wi
t
h max
imum en
t
r
opy mode
l
s f
o
r ex
t
r
a
c
t
i
ng
r
e
l
a
t
i
ons [
C ] ?
o
c o
f t
he 42nd As
s
o
c
i
a
t
i
on f
o
r
? Pr
i
ngu
i
s
t
i
c
s.S
t
r
oudsbur
Compu
t
a
t
i
ona
l L
g , PA : ACL , 2004 :
PA : ACL , 2012 : 523-534
[
23 ] Banko M , Et
z
i
on
i O. The Tr
adeo
f
f
s be
twe
en open and
t
r
ad
i
t
i
ona
l r
e
l
a
t
i
on ex
t
r
a
c
t
i
on [
C ] ?
o
c o
f t
he As
s
o
c
i
a
t
i
on
? Pr
t
a
t
i
ona
l L
i
ngu
i
s
t
i
c
s. S
t
r
oudsbur
f
o
r Compu
g , PA : ACL ,
2008 : 28-36
[
24 ] Zhu Jun , Ni
e Za
i
i
ang , L
i
u Xi
ao
i
ang , e
t a
l.S
t
a
t
Snowba
l
l : A
j
j
r
oa
ch t
o ex
t
r
a
c
t
i
ng en
t
i
t
e
l
a
t
i
onsh
i
C ] ?
s
t
a
t
i
s
t
i
c
a
l app
?
y r
ps [
Pr
o
c o
f t
he 18
t
h I
n
t Con
f on Wo
r
l
d Wi
de Web.New Yo
rk :
ACM , 2009 : 101-110
[
25 ] Al
an A , Al
exande
r L. Kr
akeN : N-a
r
a
c
t
s i
n open
y f
t
r
a
c
t
i
on [
C ] ?
o
c o
f t
he J
o
i
n
t Wo
rkshop on
i
n
f
o
rma
t
i
on ex
? Pr
Au
t
oma
t
i
c Knowl
edge Ba
s
e Cons
t
r
uc
t
i
on and Web-s
c
a
l
e
t
r
a
c
t
i
on.S
t
r
oudsbur
Knowl
edge Ex
g , PA : ACL , 2012 : 52-
56
1-22
[
15 ] L
i
u Keb
i
n , L
i Fang , L
i
u Le
i , e
t a
l.Imp
l
emen
t
a
t
i
on o
f a
ke
r
ne
l-ba
s
ed ch
i
ne
s
e r
e
l
a
t
i
on ex
t
r
a
c
t
i
on s
t
em [
J ] .J
our
na
l
ys
o
f Compu
t
e
r Re
s
e
a
r
ch and Deve
l
opmen
t , 2007 , 44 (
8 ):
,
,
,
[
26 ] McCa
l
l
um A.J
o
i
n
t i
n
f
e
r
enc
e f
o
r na
t
ur
a
l l
anguage p
r
o
c
e
s
s
i
ng
[
C ] ?
o
c o
f t
he 13
t
h Con
f on Compu
t
a
t
i
ona
l Na
t
ur
a
l
? Pr
a
r
n
i
ng.S
t
r
oudsbur
Language Le
g , PA : ACL , 2009 : 1
[
27 ] Guo J
i
any
i , L
i Zhen , Yu Zheng
t
ao , e
t a
l.Ex
t
r
a
c
t
i
on and
1406-1411 (
i
n Ch
i
ne
s
e )
(
n
f
o
rma
t
i
on ex
t
r
a
c
t
i
on [
C ] ?
o
c o
f t
he J
o
i
n
t Con
f on
f
o
r i
? Pr
f doma
i
n on
t
o
l
ogy c
onc
ep
t i
ns
t
anc
e ,
r
e
l
a
t
i
on p
r
ed
i
c
t
i
on o
.
[
J ] .
, 2007 , 44 (
8 ): 1406-1411 )
[
16 ] Ca
r
l
s
on A , Be
t
t
e
r
i
dge J , Wang R C , e
t a
l.Coup
l
ed s
emi-
supe
r
v
i
s
ed l
e
a
r
n
i
ng f
o
r i
n
f
o
rma
t
i
on ex
t
r
a
c
t
i
on [
C ] ?
o
c o
f
? Pr
a
t
t
r
i
bu
t
e and a
t
t
r
i
bu
t
e [
J ] .J
our
na
l o
f Nan
i
ng Un
i
ve
r
s
i
t
j
y :
Na
t
ur
a
l Sc
i
enc
e
s , 2012 , 48 (
4 ): 383-389 (
i
n Ch
i
ne
s
e )
(
,
,
,
d ACM I
n
t Con
f on Web Se
a
r
ch and Da
t
a Mi
n
i
ng.New
t
he 3r
、
.
[
J ] .
:
,
2012 , 48 (
4 ): 383-389 )
Yo
rk : ACM , 2010 : 101-110
[
17 ] Chen L
iwe
i , Feng Yans
ong , Zhao Dongyan. Ex
t
r
a
c
t
i
ng [
28 ] Suchanek F M , Ka
sne
c
i G , We
i
kum G.Yago : A c
o
r
e o
f
r
e
l
a
t
i
ons f
r
om t
he Web v
i
a we
ak
l
r
v
i
s
ed l
e
a
r
n
i
ng [
J ] .
y supe edge [
C ] ?
o
c o
f t
he 16
t
h I
n
t Con
f on Wo
r
l
d
s
eman
t
i
c knowl
? Pr
J
our
na
l o
f Compu
t
e
r Re
s
e
a
r
ch and Deve
l
opmen
t , 2013 , 50
(
9 ): 1825-1835 (
i
n Ch
i
ne
s
e )
(
f open da
t
a [
C ] ?
o
c o
f t
he 6
t
h I
n
t Seman
t
i
c Web
a Web o
? Pr
.
[
J ] .
Wi
de Web.New Yo
rk : ACM , 2007 : 697-706
[
29 ] Aue
r S , B
i
z
e
r C , Kob
i
l
a
r
ov G , e
t a
l.Dbped
i
a : A nuc
l
eus f
o
r
, 2013 , 50 (
9 ): 1825-1835 )
Con
f.Be
r
l
i
n : Sp
r
i
nge
r , 2007 : 722-735
17. 2016 , 53 (
3 )
5 9 8
[
30 ] Wu Fe
i , We
l
d D S.Au
t
onomous
l
eman
t
i
f
i
ng wi
k
i
i
a
y s
y
ped
[
C ] ?
o
c o
f t
he 16
t
h ACM Con
f on I
n
f
o
rma
t
i
on and
? Pr
Knowl
edge Managemen
t.New Yo
rk : ACM , 2007 : 41-50
[
31 ] Wang Yu , Tan Songbo , L
i
ao Xi
angwen , e
t a
l.Ex
t
r
a
c
t
ed
domi
n mode
l ba
s
ed named a
t
t
r
i
bu
t
e ex
t
r
a
c
t
i
on [
J ] .J
our
na
l o
f
s
e
a
r
ch and Deve
l
opmen
t , 2010 , 47 (
9 ): 1567-
Compu
t
e
r Re
1573 (
i
n Ch
i
ne
s
e )
(
,
,
[
43 ] Hobbs J R. Re
s
o
l
v
i
ng p
e
f
e
r
enc
e
s [
J ] . L
i
ngua ,
r
onoun r
1978 , 44 (
4 ): 311-338
[
44 ] Gr
o
s
z B J , We
i
ns
t
e
i
n S , J
o
sh
i A K.Cen
t
e
r
i
ng : A f
r
amewo
rk
f
o
r mode
l
i
ng t
he l
o
c
a
l c
ohe
r
enc
e o
f d
i
s
c
our
s
e [
J ] .
i
ngu
i
s
t
i
c
s , 1995 , 21 (
2 ): 203-225
Compu
t
a
t
i
ona
l L
[
45 ] Lapp
i
n S , Sha
l
om H J. An a
l
r
i
t
hm f
o
r p
r
onomi
na
l
go
e
s
o
l
u
t
i
on [
J ] .Compu
t
a
t
i
ona
l L
i
ngu
i
s
t
i
c
s , 1994 ,
anapho
r
a r
,
[
J ] .
.
20 (
4 ): 535-561
, 2010 , 47 (
9 ): 1567-1573 )
[
32 ] L
i Yang , Wang Ch
i , Han Fangq
i
u , e
t a
l.Mi
n
i
ng ev
i
denc
e
s
[
46 ] McCa
r
t
hy J F , Lehne
r
t W G. Us
i
ng de
c
i
s
i
on t
r
e
e
s f
o
r
e
s
o
l
u
t
i
on [
C ] ?
o
c o
f t
he 14
t
h I
n
t J
o
i
n
t Con
f
c
o
r
e
f
e
r
enc
e r
? Pr
t
i
t
i
s
amb
i
t
i
on [
C ] ?
o
c o
f t
he 19
t
h I
n
t
f
o
r named en
? Pr
y d
gua on Ar
t
i
f
i
c
i
a
l I
n
t
e
l
l
i
e.San Fr
anc
i
s
c
o : Mo
r
fmann ,
genc
gan Kau
Con
f on Knowl
edge Di
s
c
ove
r
t
a Mi
n
i
ng.New Yo
rk :
y and Da 1995 : 1050-1055
ACM , 2013 : 1070-1078
[
47 ] Be
an D L , Ri
l
o
f
f E.Unsupe
r
v
i
s
ed l
e
a
r
n
i
ng o
f c
on
t
ex
t
ua
l r
o
l
e
[
33 ] Han Xi
anpe
i , Sun Le , Zhao Jun.Co
l
l
e
c
t
i
ve en
t
i
t
i
nk
i
ng i
n
y l
Web t
ex
t : A g
s
ed me
t
hod [
C ] ?
o
c o
f t
he 34
t
h I
n
t
r
aph-ba
? Pr
ACM Con
f on Re
s
e
a
r
ch and Deve
l
opmen
t i
n I
n
f
o
rma
t
i
on
Re
t
r
i
eva
l.New Yo
rk : ACM , 2011 : 765-774
i
ng t
he ve
c
t
o
r spa
c
e mode
l [
C ] ?
o
c o
f t
he
c
o
r
e
f
e
r
enc
i
ng us
? Pr
n
t Con
f on Compu
t
a
t
i
ona
l l
i
ngu
i
s
t
i
c
s.S
t
r
oudsbur
17
t
h I
g ,
PA : ACL , 1998 : 79-85
[
35 ] Pede
r
s
en T , Pur
anda
r
e A , Ku
l
ka
r
n
i A.Name d
i
s
c
r
imi
na
t
i
on
l
us
t
e
r
i
ng s
imi
l
a
r c
on
t
ex
t
s [
G ] ?
o
c o
f t
he 6
t
h I
n
t Con
f
by c
? Pr
on I
n
t
e
l
l
i
e
n
t T
e
x
t Pr
o
c
e
s
s
i
ng and Compu
t
a
t
i
ona
l L
i
ngu
i
s
t
i
c
s.
g
Be
r
l
i
n :
Sp
r
i
nge
r , 2005 : 220-231
[
36 ] Ma
l
i
n B , Ai
r
o
l
d
i E , Ca
r
l
ey K.A ne
two
rk ana
l
i
s mode
l f
o
r
ys
d
i
s
amb
i
t
i
on o
f name
s i
n l
i
s
t
s [
J ] . Compu
t
a
t
i
ona
l &
gua
i
z
a
t
i
on Theo
r
2 ): 119-139
Ma
t
hema
t
i
c
a
l Or
gan
y , 2005 , 11 (
[
37 ] Han Xi
anpe
i , Zhao Jun.Named en
t
i
t
i
s
amb
i
t
i
on by
y d
gua
l
eve
r
ag
i
ng wi
k
i
i
a s
eman
t
i
c knowl
edge [
C ] ?
o
c o
f t
he
? Pr
ped
ACM
Con
f on I
n
f
o
rma
t
i
on and
Human Language Te
chno
l
og
i
e
s No
r
t
h Ame
r
i
c
an Chap
t
e
r o
f
t
he As
s
o
c
i
a
t
i
on f
o
r Compu
t
a
t
i
ona
l L
i
ngu
i
s
t
i
c
s.S
t
r
oudsbur
g ,
PA : ACL , 2004 : 297-304
[
34 ] Bagga A , Ba
l
dwi
n B. En
t
i
t
s
ed c
r
o
s
s-do
cumen
t
y-ba
18
t
h
knowl
edge f
o
r c
o
r
e
f
e
r
enc
e r
e
s
o
l
u
t
i
on [
C ] ?
o
c o
f t
he
? Pr
Knowl
edge
rk : ACM , 2009 : 215-224
Managemen
t.New Yo
[
38 ] Bune
s
cu R , Pa
s
c
a M. Us
i
ng enc
l
oped
i
c knowl
edge f
o
r
yc
named en
t
i
t
i
s
amb
i
t
i
on [
C ] ?
o
c o
f t
he Eur
ope
an
? Pr
y d
gua
Chap
t
e
r o
f t
he As
s
o
c
i
a
t
i
on f
o
r Compu
t
a
t
i
ona
l L
i
ngu
i
s
t
i
c
s.
S
t
r
oudsbur
g , PA : ACL , 2006 : 9-16
[
39 ] Sen P. Co
l
l
e
c
t
i
ve c
on
t
ex
t-awa
r
e t
op
i
c mode
l
s f
o
r en
t
i
t
y
C ] ?
o
c o
f t
he 21s
t I
n
t Con
f on Wo
r
l
d
d
i
s
amb
i
t
i
on [
? Pr
gua
Wi
de Web.New Yo
rk : ACM , 2012 : 729-738
[
40 ] Shen We
i , Wang J
i
anyong , Luo P
i
ng , e
t a
l.L
i
nden : L
i
nk
i
ng
t
i
t
i
e
s wi
t
h knowl
edge ba
s
e v
i
a s
eman
t
i
c knowl
edge
named en
[
C ] ?
o
c o
f t
he 21s
t I
n
t Con
f on Wo
r
l
d Wi
de Web.New
? Pr
Yo
rk : ACM , 2012 : 449-458
[
41 ] Ra
t
i
nov L , Ro
t
h D , Downey D , e
t a
l.Lo
c
a
l and g
l
oba
l
a
l
r
i
t
hms f
o
r d
i
s
amb
i
t
i
on t
o wi
k
i
i
a [
C ] ?
o
c o
f t
he
? Pr
go
gua
ped
l Me
e
t
i
ng o
f t
he As
s
o
c
i
a
t
i
on f
o
r Compu
t
a
t
i
ona
l
49
t
h Annua
[
48 ] Tur
ney P.Mi
n
i
ng t
he Web f
o
r s
r
sus
ynonyms : PMI-IR ve
C ] ?
o
c o
f t
he 12
t
h Eur
ope
an Con
f on
LSA on TOEFL [
? Pr
Ma
ch
i
ne Le
a
r
n
i
ng.Be
r
l
i
n : Sp
r
i
nge
r , 2001 : 491-502
[
49 ] Cheng Tao , Lauw H W , Papa
r
i
z
o
s S.En
t
i
t
o
r
y s
ynonyms f
s
t
r
uc
t
ur
ed Web s
e
a
r
ch [
J ] .IEEE Tr
ans on Knowl
edge and
i
ne
e
r
i
ng , 2012 , 24 (
10 ): 1862-1875
Da
t
a Eng
[
50 ] Pan
t
e
l P , Cr
e
s
t
an E , Bo
rkov
sky A , e
t a
l. Web-s
c
a
l
e
d
i
s
t
r
i
bu
t
i
ona
l s
imi
l
a
r
i
t
t
i
t
e
t expans
i
on [
C ] ?
o
c o
f
? Pr
y and en
y s
f on Emp
i
r
i
c
a
l Me
t
hods i
n Na
t
ur
a
l Language
t
he 2009 Con
Pr
o
c
e
s
s
i
ng.S
t
r
oudsbur
g , PA : ACL , 2009 : 938-947
[
51 ] Chakr
aba
r
t
i K , Chaudhur
i S , Cheng Tao , e
t a
l. A
f
r
amewo
rk f
o
r r
obus
t d
i
s
c
ove
r
f en
t
i
t
C ] ?
?
y o
y s
ynonyms [
Pr
o
c o
f t
he 18
t
h ACM S
IGKDD I
n
t Con
f on Knowl
edge
Di
s
c
ove
r
t
a Mi
n
i
ng.New Yo
rk : ACM , 2012 : 1384-
y and Da
1392
[
52 ] De
shpande O , Lamba D S , Tour
n M , e
t a
l.Bu
i
l
d
i
ng ,
ma
i
n
t
a
i
n
i
ng , and us
i
ng knowl
edge ba
s
e
s : A r
epo
r
t f
r
om t
he
C ] ?
o
c o
f t
he 32nd ACM S
IGMOD I
n
t Con
f on
t
r
enche
s [
? Pr
f Da
t
a.New Yo
rk : ACM , 2013 : 1209-1220
Managemen
t o
[
53 ] Mende
s P N , Müh
l
e
i
s
en H , B
i
z
e
r C.S
i
eve : L
i
nked da
t
a
l
i
t
s
s
e
s
smen
t and f
us
i
on [
C ] ?
o
c o
f t
he 2nd I
n
t
? Pr
qua
y a
Wo
rkshop on L
i
nked Web Da
t
a Managemen
t a
t Ex
t
end
i
ng
Da
t
aba
s
e Te
chno
l
ogy.New Yo
rk : ACM , 2012 : 116-123
[
54 ] Sahoo S S , Ha
l
b W , He
l
lmann S , e
t a
l.A sur
vey o
f cur
r
en
t
o
r mapp
i
ng o
f r
e
l
a
t
i
ona
l da
t
aba
s
e
s t
o RDF [
R ] .
app
r
oa
che
s f
Cambr
i
dge , MA : The W3C RDB2RDF Wo
rk
i
ng Gr
oup ,
2009
[
55 ] Mi
che
l F , Mon
t
agna
t J , Fa
r
on-Zucke
r C.A sur
ve
f RDB
y o
L
i
ngu
i
s
t
i
c
s.S
t
r
oudsbur
g , PA : ACL , 2011 : 1375-1384 r
ans
l
a
t
i
on app
r
oa
che
s and t
oo
l
s [
R ] .Ni
c
e , Fr
anc
e :
t
o RDF t
[
42 ] Ochs C , Ti
an T , Ge
l
l
e
r J , e
t a
l.Goog
l
e knows who i
s I
n
f
o
rma
t
i
c
s , S
i
l
s & Sys
t
ems Lab (
I
3S ), Un
i
ve
r
s
i
t
f
gna
y o
oday — bu
i
l
d
i
ng an on
t
o
l
ogy f
r
om s
e
a
r
ch eng
i
ne
f
amous t
i
a An
t
i
l
i
s , 2014
Ni
c
e-Soph
po
knowl
edge and DBped
i
a [
C ] ?
o
c o
f t
he 5
t
h IEEE I
n
t Con
f
? Pr [
56 ] S
t
ude
r R , Ben
ami
ns V R , Fens
e
l D. Knowl
edge
j
t
i
c Compu
t
i
ng.P
i
s
c
a
t
away , NJ : IEEE , 2011 : 320-
on Seman i
nc
i
l
e
s and me
t
hods [
J ] .Da
t
a & Knowl
edge
eng
i
ne
e
r
i
ng : Pr
p
327 Eng
i
ne
e
r
i
ng , 1998 , 25 (
1 ): 161-197
18. :
5 9 9
[
57 ] Wong W , L
i
u We
i , Bennamoun M.On
t
o
l
ogy l
e
a
r
n
i
ng f
r
om [
70 ] Lu Dao
she , Yang Sh
i
han , Wu J
i
nzhao , e
t a
l.
t
ex
t : A l
ook ba
ck and i
n
t
o t
he f
u
t
ur
e [
J ] .ACM Compu
t
i
ng I
n
t
e
r
d
i
s
c
i
l
i
na
r
e
a
s
on
i
ng on de
s
c
r
i
t
i
on l
og
i
c [
J ] .J
our
na
l
p
y r
p
Sur
veys , 2012 , 44 (
4 ): 20123915468506 o
f App
l
i
c
a
t
i
on Re
s
e
a
r
ch o
f Compu
t
e
r
s , 2013 , 29 (
12 ): 4503-
[
58 ] Wu Wen
t
ao , L
i Hongs
ong , Wang Ha
i
xun , e
t a
l.Pr
oba
s
e : A
axonomy f
o
r t
ex
t unde
r
s
t
and
i
ng [
C ] ?
o
c o
f
r
obab
i
l
i
s
t
i
c t
? Pr
p
t
he 31s
t ACM S
IGMOD I
n
t Con
f on Managemen
t o
f Da
t
a.
450 (
i
n Ch
i
ne
s
e )
(
,
,
[
J ] .
,
.
, 2013 , 29 (
12 ): 4503-4506 )
[
71 ] Dong Xi
n , Gabr
i
l
ov
i
ch E , He
i
t
z G , e
t a
l.Knowl
edge vau
l
t :
New Yo
rk : ACM , 2012 : 481-492
[
59 ] Sh
i Shumi
ng. Au
t
oma
t
i
c and s
emi-au
t
oma
t
i
c knowl
edge
A Web-s
c
a
l
e app
r
oa
ch t
o p
r
obab
i
l
i
s
t
i
c knowl
edge f
us
i
on
ex
t
r
a
c
t
i
on [
J ] .Commun
i
c
a
t
i
ons o
f t
he CCF , 2013 , 9 (
8 ): [
C ] ?
o
c o
f t
he 20
t
h I
n
t Con
f on Knowl
edge Di
s
c
ove
r
? Pr
y and
65-73 ( i
n Ch
i
ne
s
e ) n
i
ng.New Yo
rk : ACM , 2014 : 601-610
Da
t
a Mi
(
[
72 ] Tan C H , Ag
i
ch
t
e
i
n E , I
i
r
o
t
i
s P , e
t a
l.Tr
us
t , bu
t ve
r
i
f
pe
y :
[
J ] .
.
, 2013 , 9 (
8 ): 65-73 )
Pr
ed
i
c
t
i
ng c
on
t
r
i
bu
t
i
on qua
o
r knowl
edge ba
s
e
l
i
t
y f
[
60 ] Ha
r
r
i
s Z S.Di
s
t
r
i
bu
t
i
ona
l s
t
r
uc
t
ur
e [
J ] .Wo
r
d , 1954 , 10
(
23 ): 146-162
c
ons
t
r
uc
t
i
on and cur
a
t
i
on [
C ] ?
o
c o
f t
he 7
t
h ACM I
n
t
? Pr
a
r
ch and Da
t
a Mi
n
i
ng.New Yo
rk : ACM ,
Con
f on Web Se
[
61 ] Zeng Yi , Wang Dongsheng , Zhang Ti
e
l
i
n , e
t a
l.CAS
IA-
2014 : 553-562
KB : A mu
l
t
i-s
our
c
e ch
i
ne
s
e s
eman
t
i
c knowl
edge ba
s
e bu
i
l
t [
73 ] Wang Zh
i
i Juanz
i , Wang Zh
i
chun , e
t a
l.XLo
r
e : A
gang , L
f
r
om s
t
r
uc
t
ur
ed and uns
t
r
uc
t
ur
ed Web da
t
a [
G ] ?
t
i
c
? Seman l
a
r
r
aph [
c
a
l
e eng
l
i
sh-ch
i
ne
s
e b
i
l
i
ngua
l knowl
edge g
C ] ?
?
ge-s
Te
chno
l
ogy.Be
r
l
i
n : Sp
r
i
nge
r , 2014 : 75-88 Pr
o
c o
f t
he 12
t
h I
n
t Seman
t
i
c Web Con
f.New Yo
rk : ACM ,
[
62 ] Wang Zh
i
i Juanz
i , L
i Shuang
i
e , e
t a
l.Cr
o
s
s-l
i
ngua
l
gang , L
j
2013 : 121-124
knowl
edge va
l
i
da
t
i
on ba
s
ed t
axonomy de
r
i
va
t
i
on f
r
om [
74 ] Nguyen T , Mo
r
e
i
r
a V , Nguyen H , e
t a
l. Mu
l
t
i
l
i
ngua
l
he
t
e
r
ogeneous on
l
i
ne wi
k
i
s [
C ] ?
o
c o
f t
he 28
t
h Con
f on
? Pr s
chema ma
t
ch
i
ng f
o
r wi
k
i
i
a i
n
f
oboxe
s [
J ] . The
ped
Ar
t
i
f
i
c
i
a
l I
n
t
e
l
l
i
e.Men
l
o Pa
rk , CA : AAAI , 2014 : 180-
genc
f t
he VLDB Endowmen
t , 2011 , 5 (
2 ): 133-144
Pr
o
c
e
ed
i
ngs o
[
75 ] Wang Zh
i
i Zh
i
x
i
ng , L
i Juanz
i , e
t a
l. Tr
ans
f
e
r
gang , L
186
[
63 ] Wang Ch
i , Dan
i
l
ev
sky M , De
s
a
i N , e
t a
l.A phr
n
i
ng
a
s
e mi s
ed c
r
o
s
s-l
i
ngua
l knowl
edge ex
t
r
a
c
t
i
on f
o
r
l
e
a
r
n
i
ng ba
f
r
amewo
rk f
o
r r
e
cur
s
i
ve c
ons
t
r
uc
t
i
on o
f a t
op
i
c
a
l h
i
e
r
a
r
chy wi
k
i
i
a [
C ] ?
o
c o
f t
he 51s
t Annua
l Me
e
t
i
ng o
f t
he
? Pr
ped
[
C ] ?
o
c o
f t
he 19
t
h ACM S
IGKDD I
n
t Con
f on Knowl
edge
? Pr o
r Compu
t
a
t
i
ona
l L
i
ngu
i
s
t
i
c
s. S
t
r
oudsbur
As
s
o
c
i
a
t
i
on f
g ,
Di
s
c
ove
r
t
a Mn
i
ng.New Yo
rk : ACM , 2013 : 437-
y and Da
PA : ACL.2013 : 641-650
[
76 ] Fu B , Br
ennan R , De
c
l
an O S. Cr
o
s
s-l
i
ngua
l on
t
o
l
ogy
445
[
64 ] L
i
u Xueq
i
ng , Song Yangq
i
u , L
i
u Sh
i
x
i
a , e
t a
l.Au
t
oma
t
i
c mapp
i
ng and i
t
s us
e on t
he mu
l
t
i
l
i
ngua
l s
eman
t
i
c Web [
C ] ?
?
ons
t
r
uc
t
i
on f
r
om keywo
r
ds [
C ] ?
o
c o
f t
he 18
t
h
t
axonomy c
? Pr Pr
o
c o
f t
he 1s
t Wo
rkshop on t
he Mu
l
t
i
l
i
ngua
l Seman
t
i
c Web ,
ACM S
IGKDD I
n
t Con
f on Knowl
edge Di
s
c
ove
r
t
a
y and Da a
t t
he 19
t
h I
n
t Wo
r
l
d Wi
de Web Con
f ( WWW 2010 ) .
rk : ACM , 2012 : 1433-1441
Mi
n
i
ng.New Yo
Ti
l
bur
t
he
r
l
ands : CEUR-WS , 2010 : 13-20
g , Ne
[
65 ] Le
e T W , Lewi
ck
i M S , Gi
r
o
l
ami M , e
t a
l.B
l
i
nd s
our
c
e [
77 ] Wang Zh
i
chun , L
i Juanz
i , Wang Zh
i
t a
l.Cr
o
s
s-
gang , e
f mo
r
e s
our
c
e
s t
han mi
x
t
ur
e
s us
i
ng ove
r
c
omp
l
e
t
e
s
epa
r
a
t
i
on o l
i
ngua
l knowl
edge l
i
nk
i
ng a
c
r
o
s
s wi
k
i knowl
edge ba
s
e
s [
C ] ?
?
r
ep
r
e
s
en
t
a
t
i
ons [
J ] .S
i
l Pr
o
c
e
s
s
i
ng Le
t
t
e
r
s , 1999 , 6 (
4 ):
gna Pr
o
c o
f t
he 21s
t I
n
t Con
f on Wo
r
l
d Wi
de Web.New Yo
rk :
ACM , 2012 : 459-468
87-90
[
66 ] Lu Shaoyuan , Hsu K H , Kuo L
i
i
ng.A s
eman
t
i
c s
e
r
v
i
c
e
j [
78 ] Wang Zh
i
chun , L
i Juanz
i , Tang J
i
e.Boo
s
t
i
ng c
r
o
s
s-l
i
ngua
l
r
oa
ch ba
s
ed on wo
r
dne
t and SWRL r
u
l
e
s [
C ] ?
ma
t
ch app
? i
nk
i
ng v
i
a c
onc
ep
t anno
t
a
t
i
on [
C ] ?
o
c o
f t
he
knowl
edge l
? Pr
Pr
o
c o
f t
he 10
t
h IEEE I
n
t Con
f on E-Bus
i
ne
s
s Eng
i
ne
e
r
i
ng. 23r
d I
n
t J
o
i
n
t Con
f on Ar
t
i
f
i
c
i
a
l I
n
t
e
l
l
i
e.Men
l
o Pa
rk ,
genc
CA : AAAI , 2013 : 2733-2739
P
i
s
c
a
t
away , NJ : IEEE , 2013 : 419-422
[
67 ] So
che
r R , Chen Dand
i , Mann
i
ng C D , e
t a
l.Re
a
s
on
i
ng wi
t
h [
79 ] Yao Xuchen , Ben
ami
n V D.I
n
f
o
rma
t
i
on ex
t
r
a
c
t
i
on ove
r
j
neur
a
l t
ens
o
r ne
two
rks f
o
r knowl
edge ba
s
e c
omp
l
e
t
i
on [
C ] ?
? s
t
r
uc
t
ur
ed da
t
a : que
s
t
i
on answe
r
i
ng wi
t
h f
r
e
eba
s
e [
C ] ?
o
c
? Pr
Pr
o
c o
f Neur
a
l I
n
f
o
rma
t
i
on Pr
o
c
e
s
s
i
ng Sys
t
ems.Nevada , o
f t
he 52nd Annua
l Me
e
t
i
ng o
f t
he As
s
o
c
i
a
t
i
on f
o
r
Compu
t
a
t
i
ona
l L
i
ngu
i
s
t
i
c
s.S
t
r
oudsbur
g , PA : ACL , 2014 :
USA : NIPS , 2013 : 926-934
[
68 ] Lao Ni , Mi
t
che
l
l T , Cohen W W.Random wa
l
k i
n
f
e
r
enc
e
956-966
e
a
r
n
i
ng i
n a l
a
r
c
a
l
e knowl
edge ba
s
e [
C ] ?
o
c o
f t
he
and l
? Pr
ge s [
80 ] Be
r
an
t J , L
i
ang P.Seman
t
i
c pa
r
s
i
ng v
i
a pa
r
aphr
a
s
i
ng [
C ] ?
?
Con
f on Emp
i
r
i
c
a
l Me
t
hods i
n Na
t
ur
a
l Language Pr
o
c
e
s
s
i
ng. Pr
o
c o
f t
he 52nd Annua
l Me
e
t
i
ng o
f t
he As
s
o
c
i
a
t
i
on f
o
r
Compu
t
a
t
i
ona
l L
i
ngu
i
s
t
i
c
s.S
t
r
oudsbur
g , PA : ACL , 2014 :
S
t
r
oudsbur
g , PA : ACL , 2011 : 529-539
[
69 ] Yang L
i , Hu Shour
en. Knowl
edge ba
s
e i
n
f
e
r
enc
e and
ma
i
n
t
a
i
n s
t
em [
J ] . J
our
a
l o
f Na
t
i
ona
l Un
i
ve
r
s
i
t
f
ys
y o
De
f
ens
e Te
chno
l
ogy , 1991 , 13 (
2 ): 127-133 (
i
n Ch
i
ne
s
e )
(
,
.
, 1991 , 13 (
2 ): 127-133 )
(
KBIMS )[
J ] .
1415-1425
[
81 ] Fade
r A , Ze
t
t
l
emoye
r L , Et
z
i
on
i O. Open que
s
t
i
on
r cur
a
t
ed and ex
t
r
a
c
t
ed knowl
edge ba
s
e
s [
C ] ?
answe
r
i
ng ove
?
Pr
o
c o
f t
he 20
t
h ACM I
n
t Con
f on Knowl
edge Di
s
c
ove
r
y and
n
i
ng.New Yo
rk : ACM , 2014 : 1156-1165
Da
t
a Mi
19. 2016 , 53 (
3 )
6 0 0
[
82 ] Be
r
an
t J , Chou A , Fr
o
s
t
i
t a
l.Seman
t
i
c pa
r
s
i
ng on
g R , e
Duan Hong , bo
r
n i
n 1974. Ma
s
t
e
r ,
f
r
e
eba
s
e f
s
t
i
on-answe
i
r
s [
r
om que
r pa
C ] ?
o
c o
f t
he Con
f
? Pr l
e
c
t
u
r
e
r. Hi
s ma
i
n r
e
s
e
a
r
ch i
n
t
e
r
e
s
t
s
on Emp
i
r
i
c
a
l Me
t
hods i
n Na
t
ur
a
l Language Pr
o
c
e
s
s
i
ng. i
nc
l
ude ma
ch
i
ne l
e
a
r
n
i
ng and da
t
a mi
n
i
ng ,
S
t
r
oudsbur
g , PA : ACL , 2013 : 1533-1544 na
t
u
r
a
l l
anguage p
r
o
c
e
s
s
i
ng , and s
o
c
i
a
l
L
i
u Qi
r
n i
n 1974.PhD , a
s
s
o
c
i
a
t
e
a
o , bo
r
o
f
e
s
s
o
r. Membe
r o
f Ch
i
na Compu
t
e
r
p
Fede
r
a
t
i
on. Hi
s ma
i
n r
e
s
e
a
r
ch i
n
t
e
r
e
s
t
s
i
nc
l
ude ma
ch
i
ne l
e
a
r
n
i
ng , da
t
a mi
n
i
ng ,
r
o
c
e
s
s
i
ng , and s
na
t
u
r
a
l l
anguage p
o
c
i
a
l
l
s
i
s.
ne
two
r
k ana
y
L
i Yang , bo
r
n i
n 1990.Ma
s
t
e
r , s
t
uden
t
f Ch
i
na Compu
t
e
r Fede
r
a
t
i
on.
membe
r o
Hi
s ma
i
n r
e
s
e
a
r
ch i
n
t
e
r
e
s
t
s i
nc
l
ude
r
aph , ma
knowl
edge g
ch
i
ne l
e
a
r
n
i
ng and
r
o
c
e
s
s
i
ng (
anguage p
keda
shq
s@
na
t
u
r
a
l l
163.
c
om ) .
ne
two
r
k ana
l
s
i
s (
dhp
r
o @s
i
na.
c
om ) .
y
L
i
u Ya
o , bo
r
n i
n 1978.PhD , l
e
c
t
u
r
e
r.
Membe
r o
f Ch
i
na Compu
t
e
r Fede
r
a
t
i
on.
i
n r
e
s
e
a
r
ch i
n
t
e
r
e
s
t
s i
nc
l
ude s
o
c
i
a
l
He
r ma
n
a
l
s
i
s , d
a
t
a mi
n
i
ng , a
nd n
e
two
r
k
n
e
two
r
k a
y
me
a
su
r
emen
t (
l
i
uyao @ue
s
t
c.
edu.
cn ) .
n Zh
i
r
n i
n 1956. PhD ,
Qi
guang , bo
r o
f Ch
i
na
r
o
f
e
s
s
o
r. Sen
i
o
r membe
p
Compu
t
e
r Fede
r
a
t
i
on.Hi
s ma
i
n r
e
s
e
a
r
ch
i
n
t
e
r
e
s
t
s i
nc
l
ude i
n
f
o
rma
t
i
on s
e
cu
r
i
t
y and
i
n
z
mob
i
l
e c
ompu
t
i
ng (
s
t
c.
edu.
cn ) .
q
g @ue