INDEX
Explanations
special characters
statements about significant events or conditions
New Auto-Interp
Negative Logits
manship
-0.63
aditional
-0.60
precaution
-0.60
boro
-0.60
unemploy
-0.59
ceremonial
-0.58
tremend
-0.58
favourite
-0.58
squats
-0.58
civilisation
-0.58
POSITIVE LOGITS
âĢ
1.17
̶
1.11
³³³³
1.10
ãĢ
1.05
³³³
1.01
³³³³³³³³
0.98
âĢł
0.96
îĢ
0.96
*/
0.95
âĹı
0.95
Activations Density 0.543%