INDEX
Explanations
keywords or key phrases
key phrases or terms that indicate important concepts or takeaways
New Auto-Interp
Negative Logits
amused
-0.64
bene
-0.63
Whedon
-0.63
forgiveness
-0.62
shrug
-0.62
Bett
-0.61
Vulcan
-0.61
forgiving
-0.61
civ
-0.61
aunt
-0.60
POSITIVE LOGITS
Key
3.81
key
2.48
KEY
2.46
Keys
2.37
Key
2.06
keys
1.85
KEY
1.79
key
1.73
Keys
1.55
keys
1.55
Activations Density 0.011%