INDEX
Explanations
words related to negative outcomes or events
terms related to disastrous events or situations
New Auto-Interp
Negative Logits
trak
-0.85
etch
-0.80
bors
-0.77
ramid
-0.77
uni
-0.75
pel
-0.75
gat
-0.75
hung
-0.74
agos
-0.72
annis
-0.72
POSITIVE LOGITS
havoc
1.00
itous
0.98
ãĥ¼ãĥĨ
0.89
disastrous
0.85
adolesc
0.79
ly
0.78
ously
0.75
ãĥ¥
0.74
catastrophic
0.71
consequences
0.71
Activations Density 0.009%