INDEX
Explanations
phrases related to legal or criminal activities, investigations, and security breaches
references to significant events, conditions, or issues impacting society
New Auto-Interp
Negative Logits
isSpecialOrderable
-0.64
Secondly
-0.63
zan
-0.59
comprom
-0.57
},{"-0.55
underestimated
-0.55
Shades
-0.53
ensued
-0.53
secondly
-0.53
=================================
-0.52
POSITIVE LOGITS
of
1.03
of
0.80
OF
0.80
liest
0.77
fortunes
0.73
workings
0.72
forts
0.68
inations
0.66
Of
0.65
characteristics
0.63
Activations Density 0.497%