INDEX
Explanations
references to ordinary human experiences and characteristics
New Auto-Interp
Negative Logits
مصادر
-0.48
ytä
-0.47
Goed
-0.45
StrictEqual
-0.45
nakalista
-0.45
taines
-0.45
晦
-0.44
奉
-0.43
loc
-0.43
*
-0.42
POSITIVE LOGITS
humanas
0.89
ordinary
0.85
humaine
0.78
humain
0.78
AssemblyCulture
0.77
ordinary
0.76
GEBURTSDATUM
0.76
humano
0.74
istoitu
0.74
relatable
0.73
Activations Density 0.259%