INDEX
Explanations
references to a specific location, Vermont
references to the state of Vermont
New Auto-Interp
Negative Logits
omaly
-0.87
visual
-0.83
earance
-0.81
ciating
-0.79
ept
-0.79
ospace
-0.78
odynam
-0.77
otropic
-0.77
vantage
-0.77
ppo
-0.77
POSITIVE LOGITS
Burlington
1.00
Yankee
0.99
Vermont
0.90
Leaks
0.87
Sanders
0.86
Eag
0.78
ucky
0.77
Coat
0.75
Falls
0.75
Yard
0.75
Activations Density 0.012%