INDEX
Explanations
instances of someone attempting to communicate unsuccessfully and then trying again
phrases indicating missed opportunities or unanswered communication
New Auto-Interp
Negative Logits
ãĥª
-0.66
Ü
-0.62
tains
-0.61
umbn
-0.61
ranch
-0.58
ãĥĪ
-0.57
Licensed
-0.57
Auth
-0.57
ãĥĨ
-0.56
Represent
-0.55
POSITIVE LOGITS
till
1.04
anyway
1.00
until
0.97
anyways
0.97
downstairs
0.94
but
0.93
though
0.93
upstairs
0.93
afterwards
0.89
tho
0.88
Activations Density 0.581%