INDEX
Explanations
phrases emphasizing facts or beliefs
New Auto-Interp
Negative Logits
streng
-0.66
TPPStreamerBot
-0.64
ESE
-0.64
sung
-0.63
itsch
-0.62
wana
-0.61
nan
-0.61
piping
-0.59
bargain
-0.59
lungs
-0.58
POSITIVE LOGITS
ually
1.22
uality
1.18
ional
1.15
orial
1.07
oids
0.90
itious
0.89
ual
0.87
uated
0.86
finding
0.83
oid
0.83
Activations Density 0.528%