INDEX
Explanations
mentions of tar sands and related terms
mentions of tar sands and related environments
New Auto-Interp
Negative Logits
fecture
-0.72
tymology
-0.72
redit
-0.68
redundancy
-0.67
yright
-0.64
ntil
-0.64
ernandez
-0.63
rities
-0.63
uilt
-0.63
Prob
-0.62
POSITIVE LOGITS
sands
1.35
Sands
1.03
ĸļ
0.83
boro
0.81
gow
0.79
wagon
0.77
cape
0.76
heet
0.76
Wast
0.73
wastes
0.73
Activations Density 0.010%