INDEX
Explanations
phrases that indicate leadership or guidance within a group or organization
New Auto-Interp
Negative Logits
ainfi
-0.93
AssemblyTitle
-0.92
itſelf
-0.88
myſelf
-0.87
Jefus
-0.86
ſtate
-0.81
Balzac
-0.81
dévelo
-0.80
rungsseite
-0.78
auffi
-0.77
POSITIVE LOGITS
bờ
0.53
new
0.49
a
0.44
“
0.41
'][]
0.41
tem
0.40
They
0.40
tôn
0.40
rot
0.40
NEW
0.40
Activations Density 0.650%