INDEX
Explanations
mentions of scientific research institutions or their affiliated individuals
occurrences of the substring "alk" in words
New Auto-Interp
Negative Logits
ples
-0.74
============
-0.71
ACTED
-0.70
ODUCT
-0.70
CES
-0.68
======
-0.67
================================================================
-0.67
éĹ
-0.65
cus
-0.64
PDATE
-0.62
POSITIVE LOGITS
enstein
1.06
owitz
1.04
enburg
1.03
iewicz
0.95
reath
0.92
rish
0.92
ipedia
0.90
alk
0.88
edIn
0.88
ers
0.85
Activations Density 0.029%