INDEX
Explanations
mentions of people being elected into political office, especially when followed by other positive political terms
New Auto-Interp
Negative Logits
يتيمه
-0.86
zelve
-0.79
дописавши
-0.78
étoit
-0.77
resourceCulture
-0.76
bootstrapcdn
-0.74
समीक्षक
-0.73
AutoScaleMode
-0.71
légende
-0.71
ligiloj
-0.70
POSITIVE LOGITS
<bos>
1.40
↵↵
0.86
'
0.85
’
0.64
The
0.56
A
0.56
'./
0.55
You
0.54
In
0.53
<eos>
0.51
Activations Density 0.526%