INDEX
Explanations
mentions of the name "Roger"
mentions of the name "Roger"
New Auto-Interp
Negative Logits
ipeg
-0.80
runtime
-0.76
pron
-0.71
spir
-0.71
erala
-0.70
pai
-0.69
amaz
-0.67
borne
-0.67
ihar
-0.67
âĹ¼
-0.66
POSITIVE LOGITS
Roger
1.07
Roger
0.95
otiation
0.84
Goodell
0.84
Waters
0.80
rique
0.78
Rod
0.78
rers
0.75
Zel
0.75
Wim
0.73
Activations Density 0.006%