INDEX
Explanations
personal narratives containing details about decisions and experiences
New Auto-Interp
Negative Logits
WATCH
-0.76
ebin
-0.68
HERE
-0.65
claimer
-0.64
osponsors
-0.62
Appears
-0.62
Update
-0.61
currently
-0.59
ogene
-0.59
zbollah
-0.59
POSITIVE LOGITS
mattered
0.94
lacked
0.88
tended
0.87
resembled
0.80
seemed
0.79
weren
0.77
consisted
0.77
knew
0.77
didn
0.76
outnumbered
0.76
Activations Density 7.068%