INDEX
Explanations
phrases related to conflict or competition
words related to being trapped or constrained
New Auto-Interp
Negative Logits
Clinic
-0.63
paraly
-0.62
Patterson
-0.59
antit
-0.58
Subscribe
-0.56
OUN
-0.54
extent
-0.53
KN
-0.53
iod
-0.53
annabin
-0.52
POSITIVE LOGITS
ged
4.57
ging
2.99
ges
2.36
gers
2.02
ger
1.93
ge
1.82
gered
1.77
gement
1.76
gery
1.64
gence
1.62
Activations Density 0.008%