INDEX
Explanations
the presence of certain key concepts and structures in written text
New Auto-Interp
Negative Logits
iÃŁ
-0.17
ëĦ¤ìĿ´íĬ¸
-0.15
StdString
-0.15
ugen
-0.15
lẫn
-0.15
styleType
-0.15
åĩºçīĪ社
-0.15
.getFont
-0.15
/copyleft
-0.14
ipa
-0.14
POSITIVE LOGITS
0.18
4
0.17
3
0.16
2
0.15
6
0.15
0
0.15
m
0.14
1
0.14
5
0.14
30
0.14
Activations Density 0.020%