INDEX
Explanations
references to feeding, nourishment, or food-related actions and concepts
New Auto-Interp
Negative Logits
áze
-0.17
pic
-0.16
ive
-0.15
neod
-0.14
Crash
-0.14
psc
-0.14
bent
-0.14
e
-0.14
adb
-0.14
Rust
-0.13
POSITIVE LOGITS
/feed
0.25
.feed
0.25
-feed
0.24
.Feed
0.22
fed
0.20
(feed
0.20
stocks
0.19
Feed
0.19
fed
0.18
feed
0.18
Activations Density 0.021%