INDEX
Explanations
mentions of the word "Fred"
occurrences of the name "Fred."
New Auto-Interp
Negative Logits
BOOK
-0.82
Demand
-0.74
Drug
-0.71
OPLE
-0.68
ffiti
-0.64
Depth
-0.63
forth
-0.63
vain
-0.61
Draft
-0.61
calming
-0.61
POSITIVE LOGITS
rik
1.07
rique
1.06
rick
0.98
dy
0.94
dies
0.91
ric
0.90
ricks
0.90
riks
0.88
Fred
0.87
erick
0.86
Activations Density 0.008%