The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Normalization (z-transform) formula?
hi,
I am a bit confused what the true formula for z-transform normalization in rapidminer is, normally it is (X-arithmeticMean(X))/std.deviation
but what is the standard-deviation formula? one like that:
https://docs.tibco.com/pub/spotfire/6.5.0/doc/html/norm/norm_z_score.htm
or one used in studentizing:
https://de.wikipedia.org/wiki/Studentisierung
or sample variance?
https://de.wikipedia.org/wiki/Stichprobenvarianz
and what do I use if I don't know the real arithmetic mean of my values X or the probability distribution of my values of X ?
because variance uses Expectation values from X, and its variance = SUM(p(x)*X)
0
Answers
Fred,
i totally do not get the question. You are doing data mining, so you always work on estimates of it. Thats the difference between Erwartungswert and Mittelwert in german. The only question is wether you correct with the -1 or not.
~Martin
Dortmund, Germany
ok then I was getting something wrong
when do I apply -1 correction, and when not? where is the difference?
I think the correct way is to use it with -1. That makes it a unbiased estimator (erwartungstreuer schätzer) for the true std. dev.
Makes no difference for high n though.
Dortmund, Germany