INDEX
Explanations
time-related durations, like "years" or "months" spent doing something
time durations and periods associated with experiences or events
New Auto-Interp
Negative Logits
ipper
-0.61
ceilings
-0.61
stood
-0.60
rium
-0.59
awed
-0.59
strength
-0.58
ĭ
-0.56
ippy
-0.56
ogle
-0.55
circle
-0.55
POSITIVE LOGITS
studying
0.72
izabeth
0.70
researching
0.68
onding
0.68
cation
0.68
wondering
0.67
livest
0.66
photograp
0.65
pher
0.64
sterdam
0.64
Activations Density 0.071%