INDEX
Explanations
mentions of a specific entity or organization called "Ent"
the word "Entity" in various contexts
New Auto-Interp
Negative Logits
士
-0.78
strap
-0.74
Archangel
-0.72
phrine
-0.67
tiss
-0.66
manship
-0.64
ned
-0.63
CPC
-0.61
sterling
-0.61
Atmospheric
-0.61
POSITIVE LOGITS
itled
1.22
ropy
1.21
rance
1.11
rants
1.07
raction
1.03
reprene
1.00
ipt
0.96
race
0.93
ire
0.92
inct
0.91
Activations Density 0.009%