INDEX
Explanations
official or ordinary classification
New Auto-Interp
Negative Logits
針
-0.98
ஊ
-0.90
Sep
-0.88
لمانيا
-0.85
okka
-0.84
鳥居
-0.83
Outdated
-0.83
فيه
-0.83
somente
-0.83
idea
-0.83
POSITIVE LOGITS
obligatory
1.02
officially
0.97
official
0.96
mandated
0.96
normal
0.95
官方
0.93
mandatory
0.91
ordinary
0.90
一般
0.89
RICULUM
0.89
Activations Density 0.130%