INDEX
Explanations
references to the phrase "over the rainbow."
New Auto-Interp
Negative Logits
uro
-0.16
ammad
-0.15
arak
-0.15
resi
-0.15
ing
-0.15
upon
-0.15
окол
-0.14
Tenn
-0.14
our
-0.14
pleasure
-0.14
POSITIVE LOGITS
úb
0.17
borderline
0.16
Fish
0.16
ebra
0.16
border
0.15
Border
0.15
-border
0.15
angan
0.15
edge
0.15
ensburg
0.15
Activations Density 0.043%