INDEX
Explanations
references to locations and where individuals live
New Auto-Interp
Negative Logits
Downtown
-0.15
City
-0.15
uj
-0.15
urai
-0.15
City
-0.14
_tolerance
-0.14
assets
-0.14
iras
-0.13
akan
-0.13
1
-0.13
POSITIVE LOGITS
England
0.19
England
0.17
Britain
0.16
zee
0.15
Germany
0.15
Chapel
0.15
British
0.15
France
0.15
ataires
0.15
ANGLES
0.15
Activations Density 0.198%