INDEX
Explanations
phrases related to decision-making and assessment of cultural identity
New Auto-Interp
Negative Logits
ویکیپدیا
-0.65
MimeType
-0.64
florales
-0.63
itſelf
-0.57
Normdatei
-0.57
pins
-0.57
strains
-0.56
gears
-0.55
Menschheit
-0.55
üğ
-0.55
POSITIVE LOGITS
themselves
1.78
themselves
1.51
Their
1.38
Their
1.34
their
1.33
THEIR
1.17
their
1.13
they
1.02
They
0.95
selves
0.94
Activations Density 0.384%