INDEX
Explanations
statements or quotations made by people
New Auto-Interp
Negative Logits
estern
-0.95
ammy
-0.71
avorite
-0.71
peg
-0.70
OUP
-0.67
transfer
-0.66
esc
-0.66
ãĤ¼ãĤ¦ãĤ¹
-0.65
ynam
-0.65
asonic
-0.65
POSITIVE LOGITS
"[
0.87
they
0.85
it
0.84
"...
0.77
"(
0.75
"'
0.74
"â̦
0.73
goodbye
0.72
instead
0.70
otherwise
0.70
Activations Density 0.078%