INDEX
Explanations
phrases that introduce a shift in focus or provide context for a subsequent statement
references to illumination or clarity in the context of discussions or events
New Auto-Interp
Negative Logits
lycer
-0.74
irie
-0.74
irlf
-0.72
pload
-0.70
itals
-0.68
sis
-0.68
asus
-0.68
hesive
-0.66
kefeller
-0.65
Carnegie
-0.65
POSITIVE LOGITS
enment
1.02
ening
0.89
circumstances
0.85
questioning
0.80
ened
0.78
weights
0.76
weight
0.73
circumstance
0.73
bringer
0.72
lessness
0.68
Activations Density 0.016%