INDEX
Explanations
phrases indicating proximity or closeness
New Auto-Interp
Negative Logits
ICAN
-0.86
AIN
-0.74
NZ
-0.73
ERG
-0.65
OOL
-0.64
urers
-0.64
onut
-0.62
uria
-0.61
rain
-0.61
Ò
-0.61
POSITIVE LOGITS
thereto
1.08
sighted
0.97
proximity
0.95
enough
0.94
minded
0.87
to
0.82
enough
0.80
resemblance
0.76
relatives
0.75
ups
0.72
Activations Density 0.027%