INDEX
Explanations
mentions of areas and locations in close proximity
New Auto-Interp
Negative Logits
ulang
-0.52
Obispo
-0.51
idéia
-0.51
Crete
-0.51
aimer
-0.50
أعلام
-0.49
Screen
-0.48
setopt
-0.48
retan
-0.48
arbejde
-0.47
POSITIVE LOGITS
neighbors
1.11
neighbor
1.10
neighbours
1.07
neighbour
1.06
Neighbor
1.00
neigh
1.00
immédi
0.99
NEIGH
0.98
Neigh
0.95
neighbour
0.90
Activations Density 0.115%