INDEX
Explanations
phrases related to beliefs or assumptions
occurrences of the word "to" indicating intention or belief
New Auto-Interp
Negative Logits
river
-0.77
noticed
-0.66
bats
-0.65
case
-0.64
Perspective
-0.64
viewer
-0.61
Rapids
-0.61
Spaces
-0.60
finder
-0.60
iott
-0.60
POSITIVE LOGITS
be
1.04
embody
0.96
originate
0.92
indicate
0.91
derive
0.89
resemble
0.89
belong
0.89
signify
0.87
have
0.87
imply
0.86
Activations Density 0.110%