INDEX
Explanations
references to philosophy and ethical discussions
New Auto-Interp
Negative Logits
כז
-0.65
.
-0.65
Martinez
-0.64
Occurrences
-0.61
佼
-0.60
ContentAlignment
-0.60
microb
-0.57
CENT
-0.57
phen
-0.56
יי
-0.56
POSITIVE LOGITS
Philosophy
1.23
Philosophy
1.22
philosopher
1.22
philosophers
1.20
philosoph
1.20
philosophical
1.16
filosof
1.16
philosophy
1.16
Philosopher
1.14
philosophies
1.12
Activations Density 0.105%