INDEX
Explanations
adjectives or adjectival phrases with strong connotations
expressions and phrases related to warnings or caution
New Auto-Interp
Negative Logits
assemblies
-0.81
Norn
-0.77
Annotations
-0.73
ngth
-0.71
isine
-0.71
ancies
-0.67
atography
-0.65
ovies
-0.64
idas
-0.63
arettes
-0.63
POSITIVE LOGITS
nightmare
0.78
leg
0.74
gap
0.71
kill
0.68
fest
0.65
tactic
0.65
deterrent
0.64
cat
0.64
con
0.64
take
0.63
Activations Density 0.473%