INDEX
Explanations
phrases that indicate a specific location or context
mentions of specific locations or contexts in which events or actions occur
New Auto-Interp
Negative Logits
onna
-0.72
ulence
-0.67
Uriel
-0.64
rex
-0.63
gery
-0.63
agascar
-0.59
ammy
-0.59
arcity
-0.58
apest
-0.58
Taxes
-0.58
POSITIVE LOGITS
confines
1.72
bounds
1.46
boundaries
1.24
limits
1.09
borders
1.02
scope
0.99
walls
0.97
parameters
0.95
radius
0.95
realm
0.92
Activations Density 0.163%