INDEX
Explanations
occurrences of the word "the"
New Auto-Interp
Negative Logits
purpoſe
-0.83
^(@)
-0.83
Majefty
-0.80
hanem
-0.80
avoient
-0.79
للمعارف
-0.78
themſelves
-0.77
étoient
-0.74
greateſt
-0.74
Хьажоргаш
-0.74
POSITIVE LOGITS
awtextra
0.74
pageable
0.60
THE
0.59
den
0.57
kie
0.57
itemize
0.57
THE
0.57
appcompat
0.56
COUVER
0.56
η
0.53
Activations Density 0.191%