INDEX
Explanations
places or structures
references to social issues and conditions affecting people
New Auto-Interp
Negative Logits
ebin
-0.73
Operation
-0.69
ensis
-0.62
atorium
-0.62
enegger
-0.61
ILE
-0.60
Nightmares
-0.59
Solution
-0.58
Revival
-0.58
nce
-0.58
POSITIVE LOGITS
abound
1.08
mith
0.97
gal
0.93
pread
0.92
peed
0.92
hips
0.87
ranging
0.86
pots
0.86
ome
0.84
drawn
0.82
Activations Density 0.488%