INDEX
Negative Logits
ä¸įæĸŃå®ĮåĸĦ
-0.29
hum
-0.28
PropertyName
-0.26
incer
-0.26
pieces
-0.25
ihnen
-0.25
strat
-0.24
å¼ĢèįĴ
-0.24
ACHI
-0.24
reich
-0.23
POSITIVE LOGITS
Intermediate
0.27
odel
0.27
골
0.26
ogene
0.25
0.25
ìĤ¬
0.25
å·²ç»ıæĺ¯
0.25
olle
0.24
cá»Ń
0.24
åĸĿ
0.24
Activations Density 0.391%