INDEX
Explanations
mentions of a specific location or city name, particularly "Guwahati"
New Auto-Interp
Negative Logits
stood
-0.74
Icar
-0.73
where
-0.71
shows
-0.67
Faster
-0.65
bearing
-0.65
spin
-0.64
Dust
-0.64
Patient
-0.63
margins
-0.63
POSITIVE LOGITS
ati
1.08
oga
1.02
apolis
1.02
uga
0.98
ibo
0.96
udi
0.95
ya
0.94
hedral
0.94
eta
0.92
ual
0.92
Activations Density 0.012%