INDEX
Explanations
phrases related to urban environments and locations
references to biking or transportation methods
New Auto-Interp
Negative Logits
)."
-0.93
),"
-0.87
.")
-0.86
]."
-0.83
").
-0.82
"—
-0.82
)—
-0.78
"),
-0.75
ÂŃ
-0.73
],"
-0.69
POSITIVE LOGITS
âĵĺ
0.87
commented
0.80
discusses
0.75
welcomes
0.68
etheless
0.68
agrees
0.67
welcomed
0.66
echoed
0.65
eatures
0.65
senal
0.64
Activations Density 1.097%