INDEX
Explanations
phrases indicating large-scale impacts or events
New Auto-Interp
Negative Logits
مشين
-0.69
gql
-0.69
bu
-0.68
relations
-0.68
l
-0.65
eficent
-0.65
m
-0.65
perdana
-0.64
conf
-0.64
w
-0.63
POSITIVE LOGITS
myſelf
1.22
Jefus
1.20
Monfieur
1.13
themſelves
1.11
purpoſe
1.07
ſmall
1.07
himſelf
1.07
raiſ
1.06
ſeveral
1.04
ſche
1.03
Activations Density 0.156%