INDEX
Explanations
mentions of walls
references to barriers or obstructions, particularly walls
New Auto-Interp
Negative Logits
judicial
-0.66
predictive
-0.60
convenient
-0.58
Coch
-0.58
Gene
-0.56
heny
-0.55
pivotal
-0.55
ELS
-0.54
skilled
-0.54
practiced
-0.54
POSITIVE LOGITS
abies
1.17
wall
1.14
papers
1.00
paper
0.96
aby
0.94
stones
0.91
igans
0.81
odon
0.80
agos
0.79
wallpaper
0.79
Activations Density 0.005%