INDEX
Explanations
references to organizations or divisions within a bureaucratic context
New Auto-Interp
Negative Logits
autorytatywna
-0.72
-0.71
tagHelperRunner
-0.69
ագրություններ
-0.63
CloseOperation
-0.61
BorderSide
-0.60
Monfieur
-0.60
Anſ
-0.60
Попис
-0.59
دانشنامهٔ
-0.59
POSITIVE LOGITS
\#
0.63
#
0.61
$\#
0.60
Ⅱ
0.59
III
0.56
Ⅸ
0.56
Ⅲ
0.56
(“
0.56
Ⅵ
0.55
Ⅴ
0.55
Activations Density 0.307%