INDEX
Explanations
fruit names
references to various types of fruits
New Auto-Interp
Negative Logits
hematic
-0.82
RAW
-0.80
condem
-0.80
kson
-0.76
ciplinary
-0.73
raq
-0.71
rict
-0.70
iazep
-0.69
earable
-0.68
gregation
-0.68
POSITIVE LOGITS
fruit
1.43
fruit
1.20
juice
1.15
fruits
1.13
strawberries
1.02
cherry
1.02
tomatoes
0.98
mango
0.97
strawberry
0.96
juices
0.95
Activations Density 0.031%