INDEX
Explanations
mentions of locations and addresses
New Auto-Interp
Negative Logits
059
-0.20
064
-0.18
058
-0.18
057
-0.18
098
-0.18
066
-0.17
coz
-0.16
078
-0.16
067
-0.16
056
-0.15
POSITIVE LOGITS
located
0.23
111
0.21
corner
0.20
160
0.20
located
0.19
101
0.19
corner
0.19
115
0.19
0.19
145
0.19
Activations Density 0.118%