INDEX
Explanations
proper nouns
terms related to division or specificity within a context
New Auto-Interp
Negative Logits
situational
-0.79
redundancy
-0.73
dime
-0.73
brim
-0.72
sacrific
-0.67
amnesty
-0.66
sovere
-0.66
sovereignty
-0.66
runway
-0.65
loop
-0.65
POSITIVE LOGITS
milo
0.91
vous
0.87
sky
0.82
rey
0.82
lihood
0.81
ukong
0.80
furt
0.80
å§«
0.80
theless
0.80
sson
0.80
Activations Density 0.164%