INDEX
Explanations
instances of structured data presentation, such as lists and sections in a document
New Auto-Interp
Negative Logits
myſelf
-1.16
itſelf
-1.00
Monfieur
-0.95
themſelves
-0.92
ſeveral
-0.92
Jefus
-0.92
himſelf
-0.88
auffi
-0.85
ſelf
-0.84
―――――
-0.84
POSITIVE LOGITS
^
1.60
^^
0.96
^
0.96
^-
0.91
↑
0.82
^*
0.75
^'
0.71
AccessorTable
0.70
שוליים
0.69
^.
0.69
Activations Density 0.060%