INDEX
Explanations
references to various types of beans
references to beans
New Auto-Interp
Negative Logits
NR
-0.80
TN
-0.75
Spectre
-0.73
phys
-0.73
FN
-0.70
DV
-0.69
ulation
-0.69
TN
-0.68
vi
-0.68
Spect
-0.67
POSITIVE LOGITS
beans
3.94
Beans
3.37
bean
3.23
Bean
2.75
beans
2.69
bean
2.17
peas
1.57
pods
1.36
roast
1.27
tomatoes
1.26
Activations Density 0.030%