INDEX
Explanations
references to governmental decisions and development processes
New Auto-Interp
Negative Logits
\&
-0.84
&
-0.83
Honorable
-0.81
Honorable
-0.77
Whilst
-0.75
FUCKING
-0.74
Whilst
-0.74
हाँ
-0.72
(&
-0.71
noël
-0.71
POSITIVE LOGITS
0.96
,’’
0.89
‘‘
0.84
''
0.77
���
0.74
,''
0.73
.—
0.68
—
0.67
--
0.67
``
0.65
Activations Density 0.141%