INDEX
Explanations
pronouns indicating possession or ownership
New Auto-Interp
Negative Logits
itus
-0.16
389
-0.15
526
-0.15
697
-0.15
528
-0.15
689
-0.15
illus
-0.14
375
-0.14
397
-0.14
íݸ
-0.14
POSITIVE LOGITS
/
0.18
ByKey
0.14
materials
0.14
ught
0.14
(!!
0.14
Sabb
0.14
Birthday
0.13
нок
0.13
|
0.13
industrial
0.13
Activations Density 0.000%