INDEX
Explanations
phrases that indicate the presence and status of various conditions or issues
"are" followed by an adjective
are adjective/adverb
New Auto-Interp
Negative Logits
its
-1.42
Its
-1.15
Its
-1.13
itself
-1.06
its
-1.01
它的
-0.95
它
-0.91
itself
-0.78
it
-0.74
которое
-0.74
POSITIVE LOGITS
themselves
1.67
themselves
1.55
themſelves
1.37
those
1.16
serem
1.14
ones
1.12
those
1.10
amelyek
1.08
которые
1.05
Those
1.05
Activations Density 1.068%