INDEX
Explanations
references to locations or places
context of a place
New Auto-Interp
Negative Logits
Uber
-0.48
tegen
-0.47
nonchal
-0.44
disgruntled
-0.42
ueger
-0.41
autogui
-0.41
utilising
-0.41
cnico
-0.41
cchi
-0.40
Uber
-0.40
POSITIVE LOGITS
place
1.00
places
0.94
place
0.88
PLACE
0.88
Place
0.86
Places
0.86
Places
0.84
Place
0.82
places
0.81
PLACES
0.78
Activations Density 0.012%