INDEX
Explanations
transcripts
references to transcripts
New Auto-Interp
Negative Logits
vil
-0.72
ggle
-0.71
yu
-0.70
³³³³³³³³³³³³³³³³
-0.66
zzle
-0.65
deen
-0.64
pper
-0.63
Bengal
-0.62
dream
-0.62
Fury
-0.62
POSITIVE LOGITS
Transcript
1.26
transcripts
1.26
transcript
1.21
transcription
1.00
icons
0.84
snippets
0.82
ions
0.79
ophone
0.78
ophon
0.78
excerpts
0.76
Activations Density 0.013%