INDEX
Explanations
specific geographical or contextual references related to Pakistan and related political entities
start of turn user
New Auto-Interp
Negative Logits
PhysRevD
-0.59
centre
-0.54
accor
-0.52
Centre
-0.49
Bif
-0.49
RDA
-0.49
Memor
-0.48
BytesLike
-0.48
acc
-0.48
Orders
-0.48
POSITIVE LOGITS
Shiv
0.99
Shiv
0.95
Imran
0.71
sonaro
0.56
Bolsonaro
0.55
lamabad
0.54
Karachi
0.52
mijne
0.51
TextWatcher
0.49
voks
0.49
Activations Density 0.001%