INDEX
Explanations
statements or contexts related to regulations and their implications
New Auto-Interp
Negative Logits
Ùĩر
-0.15
skyt
-0.15
VRT
-0.14
olursa
-0.14
prav
-0.14
надлеж
-0.14
DBObject
-0.13
/*č↵
-0.13
conde
-0.13
rowspan
-0.13
POSITIVE LOGITS
one
0.72
ä¹ĭä¸Ģ
0.59
among
0.59
amongst
0.50
among
0.48
ÛĮÚ©ÛĮ
0.48
Among
0.46
åħ¶ä¸Ń
0.46
salah
0.44
ãģ®ä¸Ģ
0.44
Activations Density 0.179%