INDEX
Explanations
names of people and their titles or affiliations
New Auto-Interp
Negative Logits
endif
-0.17
agnost
-0.15
athe
-0.15
ochrome
-0.15
prox
-0.15
pla
-0.14
žÃŃ
-0.14
placement
-0.14
elage
-0.14
nod
-0.14
POSITIVE LOGITS
ongan
0.17
autof
0.15
knull
0.14
ãĤ·ãĥ§ãĥ³
0.13
LIBINT
0.13
Permission
0.13
]âĢı
0.13
Reporter
0.13
iceps
0.13
å¹¹
0.13
Activations Density 0.516%