INDEX
Explanations
mentions or references to presence or absence
mentions of "presence"
New Auto-Interp
Negative Logits
imb
-0.85
initions
-0.70
cipled
-0.69
bard
-0.69
ifiable
-0.68
DIT
-0.67
aired
-0.66
endiary
-0.65
YR
-0.65
strap
-0.64
POSITIVE LOGITS
presence
1.07
Presence
0.87
enance
0.78
uated
0.75
idon
0.72
terday
0.68
luster
0.67
abroad
0.66
alan
0.65
uality
0.65
Activations Density 0.019%