INDEX
Explanations
language indicating a sense of urgency or seriousness, particularly related to a negative situation
instances of the word "dire" indicating urgent or severe situations
New Auto-Interp
Negative Logits
obbies
-0.84
adesh
-0.81
nesota
-0.79
ertodd
-0.77
adding
-0.75
imately
-0.74
į
-0.73
andise
-0.72
ACP
-0.72
fixme
-0.71
POSITIVE LOGITS
wolf
0.96
gency
0.91
wolves
0.88
ly
0.87
stal
0.85
dire
0.81
bly
0.80
sted
0.78
lin
0.78
cci
0.77
Activations Density 0.011%