INDEX
Explanations
references to specific wards in a local context
New Auto-Interp
Negative Logits
متعلقه
-0.62
ioe
-0.60
itzende
-0.58
ractable
-0.58
amg
-0.57
例句
-0.56
Royal
-0.55
ोंने
-0.55
liothèque
-0.55
первых
-0.55
POSITIVE LOGITS
Ward
1.23
Ward
1.15
ward
1.11
ward
1.07
WARD
1.07
WARD
0.94
wards
0.89
dom
0.74
burned
0.73
cabin
0.66
Activations Density 0.061%