INDEX
Explanations
the act of expressing thoughts and feelings verbally, particularly emphasizing the importance of communication
New Auto-Interp
Negative Logits
sey
-0.17
h
-0.17
olin
-0.16
ust
-0.16
ir
-0.15
avin
-0.15
cont
-0.15
itched
-0.15
avo
-0.15
rit
-0.13
POSITIVE LOGITS
about
0.20
_about
0.19
ABOUT
0.17
ạch
0.17
itecture
0.16
-about
0.16
ounge
0.16
tentang
0.16
About
0.16
About
0.15
Activations Density 0.186%