INDEX
Explanations
proper nouns
occurrences of the word "dedicate" and its variations
New Auto-Interp
Negative Logits
Pastebin
-0.70
Manip
-0.67
Surge
-0.65
Recession
-0.62
=-=-=-=-=-=-=-=-
-0.62
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
-0.62
Terr
-0.62
Verge
-0.62
Train
-0.61
Weaver
-0.60
POSITIVE LOGITS
ications
1.29
ded
1.25
uced
1.21
ication
1.10
icates
1.07
icated
1.03
icating
1.03
ucing
0.99
ifferent
0.94
iving
0.93
Activations Density 0.009%