INDEX
Explanations
phrases related to logical reasoning or logic
references to logic and reasoning in various contexts
New Auto-Interp
Negative Logits
avez
-0.83
Volunte
-0.73
hold
-0.67
Shar
-0.65
emale
-0.65
ometown
-0.63
Leopard
-0.62
semble
-0.62
atern
-0.61
national
-0.61
POSITIVE LOGITS
logic
0.98
underpin
0.88
matical
0.84
ical
0.81
SourceFile
0.81
reasoning
0.81
istically
0.80
matic
0.77
ophical
0.77
dictates
0.75
Activations Density 0.024%