INDEX
Explanations
terms related to qualifications and conditions for actions or events
New Auto-Interp
Negative Logits
Con
-0.28
Con
-0.17
-Con
-0.17
conjug
-0.16
šit
-0.15
Thu
-0.15
chu
-0.15
Cone
-0.15
ufs
-0.14
ube
-0.14
POSITIVE LOGITS
icon
0.20
on
0.19
eon
0.19
cons
0.18
icons
0.18
-k
0.18
کاÙĨ
0.18
oon
0.18
-cons
0.17
建è¨Ń
0.17
Activations Density 0.062%