INDEX
Explanations
words related to strength, stability, and certainty
the word "firm" in various contexts
New Auto-Interp
Negative Logits
ĺħ
-0.83
NF
-0.76
vernment
-0.74
Emin
-0.73
DragonMagazine
-0.70
_>
-0.69
Journal
-0.68
Alley
-0.67
Jackson
-0.67
RIP
-0.66
POSITIVE LOGITS
ament
1.06
ness
0.90
footing
0.88
handshake
0.84
nesses
0.80
anyahu
0.78
tle
0.77
firm
0.77
aton
0.76
urbed
0.74
Activations Density 0.011%