INDEX
Explanations
intensifiers or adjectives expressing strong emphasis
New Auto-Interp
Negative Logits
ooth
-0.15
ipy
-0.15
chatt
-0.15
iets
-0.15
iversit
-0.14
>{@-0.14
VERAGE
-0.14
ORITY
-0.14
oor
-0.13
IGINAL
-0.13
POSITIVE LOGITS
lei
0.18
çek
0.15
thr
0.14
/*================================================================
0.14
ÑĪка
0.14
riel
0.14
-scalable
0.13
нÑı
0.13
ccione
0.13
ici
0.13
Activations Density 0.066%