INDEX
Explanations
phrases related to current events and news updates
references to the "Now" media content
New Auto-Interp
Negative Logits
Berm
-0.62
similarity
-0.60
resemb
-0.60
Parsons
-0.60
fault
-0.59
similarities
-0.58
é¾
-0.58
disob
-0.57
elim
-0.57
Parables
-0.56
POSITIVE LOGITS
adays
1.53
here
1.27
heres
1.08
ledge
0.85
ledged
0.85
Playing
0.83
Launcher
0.83
!,
0.78
lies
0.75
heric
0.74
Activations Density 0.027%