INDEX
Explanations
mentions of a specific group of people or entities
references to groups or audiences
New Auto-Interp
Negative Logits
Director
-0.76
Dialog
-0.71
orial
-0.69
Plot
-0.67
forth
-0.67
Dust
-0.66
enegger
-0.66
grass
-0.64
Dep
-0.63
ãģ®å
-0.62
POSITIVE LOGITS
sake
0.98
wishing
0.89
guessed
0.80
wanting
0.78
purposes
0.77
ufact
0.75
attending
0.74
unfamiliar
0.73
redes
0.73
interested
0.72
Activations Density 0.086%