INDEX
Explanations
mentions of geographical locations
references to regions in India, particularly those related to Bengal
New Auto-Interp
Negative Logits
Ö¼
-0.95
mble
-0.88
ettings
-0.74
orter
-0.72
ipple
-0.69
yll
-0.67
ynt
-0.66
utherford
-0.66
Kessler
-0.63
inition
-0.63
POSITIVE LOGITS
uru
1.16
Bengal
1.11
tigers
0.92
Nadu
0.91
icum
0.87
ataka
0.87
wings
0.86
tiger
0.83
Pradesh
0.82
²¾
0.81
Activations Density 0.005%