INDEX
Explanations
phrases indicating customization or tailoring to specific needs
New Auto-Interp
Negative Logits
YG
-0.16
dikke
-0.16
SDL
-0.15
emoc
-0.15
Zem
-0.14
spo
-0.14
Cyrus
-0.14
Äįem
-0.14
Pam
-0.14
rika
-0.13
POSITIVE LOGITS
elsen
0.16
etter
0.15
581
0.15
Karlov
0.14
LinkId
0.14
mach
0.14
otta
0.14
fab
0.14
Barnett
0.14
-Ray
0.13
Activations Density 0.026%