INDEX
Explanations
phrases related to written communication
markers of sentence structure or formatting cues
New Auto-Interp
Negative Logits
frontline
-0.55
aky
-0.53
ussy
-0.52
emaker
-0.51
proble
-0.51
ÂŃ
-0.49
coni
-0.49
chuk
-0.49
afer
-0.48
ady
-0.47
POSITIVE LOGITS
huh
0.85
please
0.82
please
0.82
etc
0.80
govtrack
0.79
channelAvailability
0.70
whence
0.66
rete
0.65
wherein
0.64
yeah
0.63
Activations Density 0.235%