INDEX
Explanations
thoughts and reflections on identity and societal perceptions
New Auto-Interp
Negative Logits
TSR
-0.07
uw
-0.07
orias
-0.07
/GPL
-0.06
valuator
-0.06
queryInterface
-0.06
iversite
-0.06
rlen
-0.06
overt
-0.06
å¡ļ
-0.06
POSITIVE LOGITS
nor
0.15
Nor
0.12
Nope
0.10
sondern
0.10
Nor
0.10
nor
0.10
sino
0.09
ãĤĢ
0.08
بÙĦÚ©Ùĩ
0.07
éĤ£æł·
0.07
Activations Density 0.024%