INDEX
Explanations
names of places or entities
references to examples and comparisons
New Auto-Interp
Negative Logits
imperialist
-0.85
ascus
-0.82
Whilst
-0.78
ocaust
-0.77
respective
-0.77
Sovere
-0.76
secution
-0.75
Conclusion
-0.74
ocide
-0.74
Therefore
-0.69
POSITIVE LOGITS
brainstorm
0.80
Craigslist
0.77
perk
0.74
Yelp
0.74
0.74
broch
0.73
favorites
0.72
classmate
0.72
personalized
0.72
webcam
0.71
Activations Density 1.778%