INDEX
Explanations
phrases that indicate a specific location or context
references to timeframes and contextual phrases
New Auto-Interp
Negative Logits
arcity
-0.73
viol
-0.65
pian
-0.64
ocre
-0.62
daq
-0.61
torches
-0.60
Luigi
-0.59
culosis
-0.59
gery
-0.58
piece
-0.58
POSITIVE LOGITS
confines
1.58
bounds
1.27
boundaries
0.97
limits
0.93
borders
0.91
radius
0.89
parameters
0.88
ombat
0.86
realms
0.84
circles
0.84
Activations Density 0.121%