INDEX
Explanations
instances of words related to intense emotions or actions
names and identifying terms related to specific people or entities
New Auto-Interp
Negative Logits
rast
-0.82
Ģ
-0.79
į
-0.76
Frames
-0.75
ersed
-0.75
apon
-0.74
ADRA
-0.73
Kuro
-0.73
apons
-0.73
ESH
-0.72
POSITIVE LOGITS
stown
0.74
wills
0.74
mint
0.73
enture
0.70
ithing
0.69
Yards
0.67
Belt
0.65
idges
0.65
tongues
0.65
inh
0.65
Activations Density 0.059%