INDEX
Explanations
instances where someone has experience or knowledge in a certain subject
references to individuals' experiences or identities
New Auto-Interp
Negative Logits
Subst
-0.76
interstitial
-0.69
auga
-0.65
Surprise
-0.63
Prompt
-0.63
Splash
-0.62
scorer
-0.62
Reloaded
-0.61
prompts
-0.60
guiName
-0.60
POSITIVE LOGITS
studied
1.18
watched
1.11
interacted
1.04
frequ
1.03
worked
1.02
lived
1.02
attended
1.01
listened
0.99
convers
0.95
witnessed
0.95
Activations Density 0.249%