INDEX
Explanations
adjectives describing high intensity or extreme satisfaction
expressions conveying a strong degree of emphasis or intensity
New Auto-Interp
Negative Logits
ership
-0.65
Newsletter
-0.64
excerpts
-0.64
Flavoring
-0.63
ulia
-0.61
holder
-0.60
works
-0.59
Citizens
-0.59
Altern
-0.59
Afee
-0.59
POSITIVE LOGITS
ooo
1.49
oooo
1.48
oooooooo
1.29
bered
1.16
darn
1.16
oo
1.12
damn
1.08
apy
1.07
oths
1.06
goddamn
1.05
Activations Density 0.049%