INDEX
Explanations
references to political figures and countries
references to numeric values and measurements associated with technology
New Auto-Interp
Negative Logits
tranquil
-0.75
heit
-0.65
ĸļ
-0.64
Quantity
-0.62
URR
-0.62
cla
-0.62
Cyrus
-0.61
mistrust
-0.59
utsche
-0.59
saddened
-0.58
POSITIVE LOGITS
iev
0.82
peat
0.77
ateral
0.76
ilver
0.76
idon
0.72
GHz
0.71
OD
0.71
handled
0.70
thodox
0.69
teness
0.69
Activations Density 0.302%