INDEX
Explanations
statements related to events, announcements, and actions taken by organizations or entities
New Auto-Interp
Negative Logits
myſelf
-0.92
himſelf
-0.86
purpoſe
-0.71
fubject
-0.71
Monfieur
-0.69
personlig
-0.67
myself
-0.66
personally
-0.66
personalmente
-0.66
ſeveral
-0.65
POSITIVE LOGITS
its
1.58
itself
1.43
Its
1.30
Its
1.26
itself
1.15
its
1.09
Itself
1.05
它的
0.92
在其
0.81
яке
0.77
Activations Density 0.680%