INDEX
Explanations
proper nouns or names
specific names and references to individuals or groups
New Auto-Interp
Negative Logits
hap
-0.90
pell
-0.66
ivably
-0.59
oven
-0.57
advertisement
-0.55
2020
-0.55
lov
-0.54
versus
-0.54
endif
-0.53
088
-0.52
POSITIVE LOGITS
taboola
0.67
tro
0.66
scrut
0.65
wont
0.60
Tata
0.57
',
0.57
tarians
0.57
',
0.56
Swed
0.56
enum
0.55
Activations Density 0.396%