INDEX
Explanations
locations or objects in a specific context
instances of discovery or observation in a narrative context
New Auto-Interp
Negative Logits
vic
-0.77
ña
-0.73
nom
-0.64
elector
-0.64
ñ
-0.62
endeav
-0.61
atre
-0.60
ivil
-0.60
ico
-0.59
spirit
-0.58
POSITIVE LOGITS
ODY
0.90
EMOTE
0.80
similarities
0.78
Suddenly
0.72
something
0.71
IRT
0.70
clues
0.69
anew
0.66
glimps
0.66
Finding
0.66
Activations Density 0.203%