INDEX
Explanations
prepositions or phrases indicating extent or degree
prepositions and phrases indicating relationships or connections
New Auto-Interp
Negative Logits
ENTS
-0.62
aeda
-0.61
antly
-0.59
apolis
-0.56
DEN
-0.55
arch
-0.55
addafi
-0.55
OV
-0.52
stract
-0.50
ablishment
-0.49
POSITIVE LOGITS
which
2.13
which
1.83
whom
1.58
Which
1.53
Which
1.42
whose
1.38
whose
1.26
whence
1.25
wherein
1.11
whereby
0.93
Activations Density 0.953%