INDEX
Explanations
adjectives and nouns related to intense or dramatic situations
hyperbolic expressions related to social or political commentary
New Auto-Interp
Negative Logits
uilt
-0.61
Created
-0.61
itamin
-0.60
ayn
-0.60
SPONSORED
-0.60
roma
-0.59
Located
-0.59
Selected
-0.57
Occupations
-0.57
Facilities
-0.56
POSITIVE LOGITS
understatement
1.01
caution
0.99
guesses
0.91
chuckle
0.89
caveats
0.87
irony
0.86
disclaimer
0.83
caveat
0.83
cynicism
0.80
bole
0.80
Activations Density 0.954%