INDEX
Explanations
references to the city of Tel Aviv
references to the Telugu film industry
New Auto-Interp
Negative Logits
subp
-0.65
fronts
-0.60
measures
-0.59
grave
-0.59
lihood
-0.58
Fet
-0.57
PID
-0.56
ONSORED
-0.56
semester
-0.56
judging
-0.55
POSITIVE LOGITS
Aviv
1.49
ugu
1.24
estial
1.17
eno
1.09
stra
1.03
ibia
0.99
ford
0.98
icon
0.93
edy
0.92
esis
0.92
Activations Density 0.027%