INDEX
Explanations
proper nouns or names of people or places
references to specific individuals and entities
New Auto-Interp
Negative Logits
rote
-0.78
actionDate
-0.68
geon
-0.62
redits
-0.61
izons
-0.61
eport
-0.60
@#&
-0.60
wered
-0.58
ledged
-0.58
ãĤ¨ãĥ«
-0.57
POSITIVE LOGITS
being
1.72
becoming
1.66
having
1.59
needing
1.49
getting
1.48
owning
1.45
gaining
1.40
discovering
1.40
losing
1.38
finding
1.38
Activations Density 0.515%