INDEX
Explanations
phrases related to unfolding situations or processes
phrases related to events or situations unfolding over time
New Auto-Interp
Negative Logits
asus
-0.87
cius
-0.77
antry
-0.71
udi
-0.68
shaw
-0.68
avorite
-0.67
erva
-0.65
resents
-0.64
oppy
-0.64
aints
-0.64
POSITIVE LOGITS
fitted
0.95
stretched
0.87
differently
0.85
wards
0.85
casts
0.80
loud
0.79
posts
0.76
exactly
0.72
matched
0.71
flows
0.71
Activations Density 0.060%