INDEX
Explanations
concepts or ideas related to abstract or philosophical topics
phrases related to definitions and interpretations of concepts or issues
New Auto-Interp
Negative Logits
dayName
-0.68
etz
-0.64
Seat
-0.63
)]
-0.61
ktop
-0.60
wcs
-0.59
IER
-0.59
heels
-0.58
depended
-0.58
orah
-0.57
POSITIVE LOGITS
tnc
0.82
ãģĻ
0.76
reality
0.76
rationality
0.73
rium
0.73
Marxism
0.72
sorts
0.72
Christianity
0.72
thood
0.71
course
0.71
Activations Density 0.180%