INDEX
Explanations
proper noun phrases that could refer to historical events, items, or individuals
New Auto-Interp
Negative Logits
achus
-0.79
osate
-0.76
cation
-0.70
ob
-0.70
onis
-0.70
heit
-0.69
za
-0.67
posal
-0.67
ë
-0.67
fecture
-0.67
POSITIVE LOGITS
kinds
1.13
sorts
1.07
facts
1.00
types
0.95
truths
0.94
sentiments
0.91
fellows
0.90
thoughts
0.90
qualities
0.88
distinctions
0.87
Activations Density 0.754%