INDEX
Explanations
references to medical conditions or terminology, especially those related to body parts or processes
instances of the word "od" and its variations
New Auto-Interp
Negative Logits
Ago
-0.67
Leone
-0.62
_>
-0.62
vant
-0.61
kson
-0.60
Ana
-0.60
Faul
-0.57
Ki
-0.57
UGE
-0.55
Has
-0.55
POSITIVE LOGITS
yssey
1.35
ragon
1.04
rome
1.03
iox
1.03
iamond
1.00
iversity
0.99
sworth
0.96
iazep
0.94
ouble
0.93
iscover
0.93
Activations Density 0.035%