INDEX
Explanations
phrases related to official statements or sources
New Auto-Interp
Negative Logits
charms
-0.72
gra
-0.67
;;;;;;;;;;;;
-0.66
Thumbnail
-0.66
Cure
-0.65
Rape
-0.64
ç¥ŀ
-0.63
profits
-0.63
Babe
-0.63
grass
-0.62
POSITIVE LOGITS
unnamed
1.12
source
1.01
Another
0.95
sources
0.92
unidentified
0.91
Sources
0.90
insider
0.86
Another
0.82
another
0.81
personnel
0.80
Activations Density 0.262%