INDEX
Explanations
references to a specific geographic feature - a river
references to rivers
New Auto-Interp
Negative Logits
Reloaded
-0.75
Mach
-0.68
erity
-0.67
MFT
-0.62
ACH
-0.62
Horowitz
-0.61
iaries
-0.61
ernel
-0.61
xus
-0.60
Pie
-0.59
POSITIVE LOGITS
bank
1.03
front
1.02
banks
1.01
basin
0.94
river
0.93
valley
0.92
ine
0.88
rivers
0.87
valleys
0.85
canyon
0.85
Activations Density 0.028%