INDEX
Explanations
phrases indicating something is under consideration or influence
instances of the word "subject" in various contexts
New Auto-Interp
Negative Logits
yip
-0.67
ERROR
-0.64
cia
-0.61
owler
-0.59
ces
-0.58
ttp
-0.57
hetto
-0.56
SER
-0.55
cane
-0.55
gian
-0.54
POSITIVE LOGITS
ively
1.38
ivity
1.33
ivist
1.23
ivities
1.15
matter
1.09
ivism
1.09
ive
1.08
ion
1.05
ivation
1.02
ivating
0.99
Activations Density 0.032%