INDEX
Explanations
phrases related to procedural or administrative documentation
New Auto-Interp
Negative Logits
Ñĸ
-0.17
conf
-0.15
ÑĶ
-0.15
mat
-0.15
quint
-0.15
class
-0.15
ident
-0.14
tall
-0.14
chn
-0.14
bart
-0.14
POSITIVE LOGITS
ÙģÙĬ
0.22
ÙĪ
0.22
Ùĥ
0.20
ÙĦ
0.20
ÂłÙħ
0.20
ÙħÙĨ
0.19
بر
0.19
عÙĦÙī
0.18
ØĮ
0.18
با
0.18
Activations Density 0.021%