INDEX
Explanations
concepts related to authenticity and connection to history
New Auto-Interp
Negative Logits
331
-0.17
at
-0.16
ivo
-0.15
lick
-0.15
aba
-0.15
une
-0.15
ÙĪØ·
-0.14
ey
-0.14
zej
-0.14
viá»ĩc
-0.14
POSITIVE LOGITS
impact
0.21
impact
0.20
Impact
0.19
meaning
0.18
significance
0.18
characteristics
0.17
Impact
0.17
relevance
0.16
bearing
0.16
appeal
0.16
Activations Density 0.244%