INDEX
Explanations
linguistic patterns and syntactic structures within textual content
New Auto-Interp
Negative Logits
Residence
-0.15
¬ģ
-0.15
cel
-0.14
residence
-0.14
he
-0.14
ÙĪØ§Øª
-0.13
kontakte
-0.13
лоÑĩ
-0.13
èĻŁ
-0.13
Locker
-0.13
POSITIVE LOGITS
Horton
0.16
dum
0.16
PointerType
0.15
аÑĤо
0.15
ards
0.14
ätt
0.14
orna
0.14
agnost
0.14
Dro
0.14
orta
0.14
Activations Density 0.008%