INDEX
Explanations
references to celebrity situations and support for personal struggles
New Auto-Interp
Negative Logits
gaard
-0.17
nat
-0.15
allax
-0.15
Plate
-0.15
plate
-0.15
ableOpacity
-0.14
acin
-0.14
plates
-0.14
tsky
-0.14
Deque
-0.14
POSITIVE LOGITS
Spears
0.38
Brit
0.30
conserv
0.29
Spe
0.29
spe
0.28
Brit
0.26
Conserv
0.26
Spear
0.23
Circus
0.22
-spe
0.21
Activations Density 0.004%