INDEX
Explanations
intensifiers and adverbs that emphasize degree or extent
New Auto-Interp
Negative Logits
Lit
-0.16
Lit
-0.15
enuous
-0.15
battery
-0.14
arme
-0.14
rees
-0.14
Aliases
-0.14
ury
-0.14
reet
-0.13
UNET
-0.13
POSITIVE LOGITS
ynchronously
0.22
differently
0.21
astically
0.19
grily
0.18
ulously
0.17
ensively
0.17
наÑĩе
0.16
clus
0.16
/by
0.16
ensibly
0.16
Activations Density 0.108%