INDEX
Explanations
words related to legal entities or individuals' names
mentions of a specific individual named "Card" or variations of the name
New Auto-Interp
Negative Logits
¿½
-1.02
hower
-0.79
Ń·
-0.73
ĸļ
-0.73
merce
-0.72
á½
-0.71
ithing
-0.70
%]
-0.67
Siem
-0.66
akings
-0.65
POSITIVE LOGITS
iovascular
1.38
inals
1.23
iologist
1.15
iac
1.15
ozo
1.10
assian
1.06
igans
1.01
iov
1.00
board
1.00
igan
0.99
Activations Density 0.018%