INDEX
Explanations
phrases and discussions related to the impact of media violence on health and behavior
New Auto-Interp
Negative Logits
ynth
-0.55
dibuat
-0.49
쓴
-0.46
Destroyed
-0.45
elected
-0.44
Fired
-0.44
dizer
-0.44
Applied
-0.43
立つ
-0.43
apati
-0.43
POSITIVE LOGITS
covered
1.15
addressed
0.98
httphttps
0.98
Covered
0.92
covered
0.92
discussed
0.92
Covered
0.90
explored
0.89
dealt
0.84
Gegenstand
0.78
Activations Density 0.708%