INDEX
Explanations
references to different types of vans or vehicles
vehicles like vans and trucks
New Auto-Interp
Negative Logits
__':
-0.47
...',
-0.44
...".
-0.43
ouro
-0.43
intracellular
-0.43
".
-0.42
$.
-0.42
......”
-0.41
Argentine
-0.41
..."
-0.41
POSITIVE LOGITS
vans
0.90
furg
0.81
TRUCK
0.71
camion
0.68
Trucks
0.68
truck
0.67
camión
0.66
trucks
0.66
camiones
0.66
minivan
0.64
Activations Density 0.006%