INDEX
Explanations
references to individuals and their roles or actions within specific contexts
Third-person pronouns followed by verbs
pronoun + verb
New Auto-Interp
Negative Logits
Doing
-0.60
oughlin
-0.55
doing
-0.55
Done
-0.55
Doing
-0.55
gway
-0.54
expandindo
-0.51
done
-0.50
Done
-0.49
Mucha
-0.47
POSITIVE LOGITS
ComVisible
0.72
berdayakan
0.65
ArrowToggle
0.63
ReusableCell
0.60
belong
0.60
RegressionTest
0.58
ptest
0.56
dalamnya
0.56
aDecoder
0.54
belongs
0.53
Activations Density 0.177%