INDEX
Explanations
words related to physical features and attributes
concepts related to organization and structure
New Auto-Interp
Negative Logits
cale
-0.73
ynski
-0.72
uden
-0.72
Tribunal
-0.64
nesday
-0.62
mma
-0.61
scl
-0.59
Il
-0.58
aurus
-0.58
Gazette
-0.56
POSITIVE LOGITS
busters
0.94
lessly
0.94
wise
0.88
less
0.86
breakers
0.84
able
0.75
ishly
0.75
ably
0.74
iques
0.74
ically
0.73
Activations Density 0.705%