INDEX
Explanations
references to walls and their characteristics
New Auto-Interp
Negative Logits
Judson
-0.80
temprana
-0.80
Plural
-0.76
foncé
-0.76
ujednoznacz
-0.76
wezen
-0.74
noires
-0.73
pleaſure
-0.73
inoxydable
-0.73
epam
-0.73
POSITIVE LOGITS
wall
2.00
WALL
1.91
Wall
1.89
walls
1.81
Wall
1.76
wall
1.72
Walls
1.66
WALL
1.61
Walls
1.53
walls
1.53
Activations Density 0.054%