INDEX
Explanations
instances of punctuation and special characters, specifically periods and parentheses
New Auto-Interp
Negative Logits
-0.56
in
-0.53
ment
-0.51
fram
-0.51
2
-0.50
5
-0.49
ավ
-0.47
(
-0.47
opp
-0.47
————————————————
-0.46
POSITIVE LOGITS
resourceCulture
1.14
$.}
1.13
'].'
1.06
kasarigan
1.05
__).
1.02
()].
0.97
`.
0.96
للمعارف
0.96
).}
0.94
ⓧ
0.94
Activations Density 0.939%