INDEX
Explanations
phrases related to normalcy in relationships and the lived experiences of marginalized groups
New Auto-Interp
Negative Logits
kasarigan
-0.76
ectoria
-0.75
-0.67
rungsseite
-0.66
GenerationType
-0.61
UnknownFields
-0.58
Slf
-0.58
logr
-0.52
opsida
-0.51
IonicModule
-0.51
POSITIVE LOGITS
normal
1.06
normal
0.96
Normal
0.93
normaux
0.87
Normal
0.87
ordinary
0.87
normaal
0.87
normale
0.87
conventional
0.86
普通の
0.85
Activations Density 0.324%