INDEX
Explanations
multi-word expressions that connote struggle or resilience
New Auto-Interp
Negative Logits
furt
-0.93
ivas
-0.90
iov
-0.83
vier
-0.83
jan
-0.81
orate
-0.81
ça
-0.80
intend
-0.80
izont
-0.80
INT
-0.79
POSITIVE LOGITS
alike
1.02
dictated
0.72
stamped
0.67
engraved
0.66
timeless
0.66
incarn
0.66
depending
0.65
etched
0.65
embodied
0.65
striped
0.64
Activations Density 0.085%