INDEX
Explanations
quotations
occurrences of quotation marks
New Auto-Interp
Negative Logits
glac
-0.91
periodic
-0.90
brid
-0.89
stocking
-0.87
pudding
-0.87
confir
-0.86
accomp
-0.84
grip
-0.83
Þ
-0.82
deton
-0.82
POSITIVE LOGITS
They
1.85
We
1.78
It
1.78
Especially
1.77
That
1.76
But
1.76
Nobody
1.75
And
1.74
Because
1.74
Everybody
1.74
Activations Density 0.094%