INDEX
Explanations
references to specific locations, particularly states and fictional towns
New Auto-Interp
Negative Logits
ëĦIJ
-0.17
uls
-0.16
Calgary
-0.15
Alberta
-0.15
Edmonton
-0.15
Saskatchewan
-0.15
993
-0.15
actus
-0.15
нен
-0.14
ipzig
-0.14
POSITIVE LOGITS
Rhode
0.57
RI
0.47
Providence
0.43
Narr
0.42
Rh
0.40
RI
0.38
Narr
0.36
Warwick
0.36
Newport
0.36
RIPT
0.36
Activations Density 0.029%