INDEX
Explanations
references to the word "Rice"
references to the name "Rice."
New Auto-Interp
Negative Logits
merce
-0.73
arios
-0.70
istical
-0.69
raints
-0.69
destruct
-0.67
istically
-0.67
ript
-0.67
umar
-0.66
ulhu
-0.66
imen
-0.66
POSITIVE LOGITS
Rice
0.94
cloth
0.90
boro
0.86
Gear
0.82
Kris
0.81
cooker
0.80
wall
0.74
brook
0.73
Ve
0.72
cook
0.72
Activations Density 0.014%