INDEX
Explanations
video-related keywords and phrases
content related to video titles or video watch prompts
New Auto-Interp
Negative Logits
kowski
-0.62
schild
-0.61
heit
-0.61
adem
-0.57
wagon
-0.57
umenthal
-0.53
anka
-0.52
Sparrow
-0.52
lain
-0.51
acea
-0.51
POSITIVE LOGITS
MUST
1.38
must
0.78
Must
0.75
Must
0.71
WATCH
0.66
SHOULD
0.63
must
0.63
Became
0.62
Spawn
0.61
MAY
0.60
Activations Density 0.026%