INDEX
Explanations
references to enclosed spaces or barriers
New Auto-Interp
Negative Logits
enthal
-0.84
ãĥ£
-0.74
nant
-0.74
76561
-0.71
ãĤ§
-0.70
rouse
-0.68
zinski
-0.67
ku
-0.66
ruary
-0.65
enhagen
-0.65
POSITIVE LOGITS
osures
1.33
encl
0.98
izabeth
0.87
enclosed
0.83
enclosure
0.83
spaces
0.76
ridges
0.73
circuits
0.72
ously
0.72
aves
0.71
Activations Density 0.006%