INDEX
Explanations
questions or statements expressing debate or inquiry
key nouns and their relationships in complex statements
New Auto-Interp
Negative Logits
comfort
-0.81
enegger
-0.76
Pic
-0.72
Vers
-0.71
ufact
-0.69
opens
-0.69
Wil
-0.69
ickr
-0.66
laughs
-0.66
Font
-0.64
POSITIVE LOGITS
differ
0.78
shine
0.75
stray
0.67
exist
0.66
surrog
0.66
represent
0.64
mean
0.62
exist
0.62
resonate
0.60
suffice
0.60
Activations Density 0.246%