INDEX
Explanations
references to "television."
New Auto-Interp
Negative Logits
echip
-0.43
nevoie
-0.42
лтамалар
-0.41
neutre
-0.41
Beteiligung
-0.40
semn
-0.40
viață
-0.39
Wissenschaften
-0.39
păr
-0.39
picioare
-0.39
POSITIVE LOGITS
television
0.84
Television
0.75
television
0.73
Television
0.69
tv
0.66
Tv
0.65
TV
0.64
createState
0.60
insertion
0.60
insertions
0.59
Activations Density 0.271%