INDEX
Explanations
high-impact words or phrases that evoke strong emotions or reactions
capital letters or proper nouns
New Auto-Interp
Negative Logits
EStream
-0.75
Hobbit
-0.73
Wast
-0.69
Oath
-0.69
pony
-0.68
Alchemist
-0.68
foundland
-0.66
LOD
-0.62
Ħ¢
-0.61
hyde
-0.60
POSITIVE LOGITS
cially
1.10
itting
1.07
rarily
1.06
ually
1.06
itionally
1.04
ifully
1.03
cknowled
1.02
ately
1.00
aring
0.99
quartered
0.99
Activations Density 0.236%