INDEX
Explanations
references to railway companies and related terminology
New Auto-Interp
Negative Logits
carriage
-0.19
railway
-0.15
ickets
-0.15
алеж
-0.14
darm
-0.14
avig
-0.14
metro
-0.14
Salah
-0.14
safezone
-0.14
ker
-0.14
POSITIVE LOGITS
GP
0.21
repaint
0.20
roster
0.19
Shops
0.18
UP
0.18
Clin
0.17
Chess
0.17
reef
0.17
SP
0.17
cabo
0.17
Activations Density 0.006%