INDEX
Explanations
mentions of the word "turkey."
references to turkey
New Auto-Interp
Negative Logits
ardo
-0.79
iott
-0.75
iam
-0.74
ingly
-0.72
largeDownload
-0.70
ister
-0.69
ances
-0.68
orius
-0.68
esis
-0.66
inel
-0.66
POSITIVE LOGITS
geon
0.80
geons
0.80
gie
0.75
STEM
0.74
Rex
0.72
Sabres
0.71
gery
0.69
BIL
0.69
Nug
0.65
pora
0.64
Activations Density 0.029%