INDEX
Explanations
phrases related to official statements or announcements
references to formal statements or announcements
New Auto-Interp
Negative Logits
upiter
-0.90
avorite
-0.79
existent
-0.79
rys
-0.78
elsius
-0.72
cephal
-0.72
estern
-0.70
cffff
-0.70
opes
-0.69
rowd
-0.68
POSITIVE LOGITS
statement
1.13
Statement
0.97
statements
0.95
warr
0.93
soType
0.85
gow
0.82
encour
0.76
Statement
0.75
ariat
0.74
goodbye
0.73
Activations Density 0.035%