INDEX
Explanations
references to specific deities and religious figures
Mythological figures, religious concepts, and diverse noun references
New Auto-Interp
Negative Logits
extAlignment
-0.62
URLException
-0.62
purpoſe
-0.59
Aton
-0.57
calaureate
-0.56
pantalones
-0.52
ſind
-0.52
CppCodeGen
-0.52
zzleHttp
-0.51
Broth
-0.51
POSITIVE LOGITS
Aholisi
0.38
Popov
0.34
autorytatywna
0.33
Dış
0.31
Witam
0.30
muñeca
0.30
hôtel
0.30
เส
0.30
Viited
0.29
continúas
0.29
Activations Density 0.025%