INDEX
Explanations
references to the TV show "Orange is the New Black."
occurrences of the word "Orange."
New Auto-Interp
Negative Logits
Ö¼
-0.98
sonian
-0.86
stood
-0.78
adr
-0.78
enance
-0.78
rets
-0.73
ngth
-0.73
akable
-0.72
arnaev
-0.71
ted
-0.70
POSITIVE LOGITS
vale
1.09
peel
1.05
Blossom
1.05
juice
1.00
Peel
0.97
Juice
0.94
issance
0.89
Crush
0.83
cones
0.79
berry
0.78
Activations Density 0.018%