INDEX
Explanations
adjectives or phrases with strong emotional connotations
single occurrences of various letters or symbols in the text
New Auto-Interp
Negative Logits
interacts
-0.66
Chao
-0.64
querque
-0.62
targ
-0.62
KS
-0.61
bake
-0.61
ãĤ¼ãĤ¦ãĤ¹
-0.60
enhagen
-0.60
Decker
-0.59
Dres
-0.59
POSITIVE LOGITS
seless
1.03
icious
1.02
bidden
1.02
ificantly
0.99
ateful
0.99
iscal
0.97
initely
0.97
usterity
0.95
actory
0.95
thora
0.94
Activations Density 0.210%