INDEX
Explanations
mentions of professional qualifications and affiliations
New Auto-Interp
Negative Logits
ikat
-0.16
plash
-0.15
etter
-0.15
hevik
-0.15
addock
-0.15
ottle
-0.14
terk
-0.14
odka
-0.14
Kaf
-0.14
aket
-0.14
POSITIVE LOGITS
cion
0.16
Tate
0.14
status
0.14
Stra
0.14
conde
0.13
vers
0.13
rowable
0.13
paved
0.13
dia
0.13
Tray
0.13
Activations Density 0.034%