The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
How to add noises without adding attributes
I just want to add noise data on the basis of original attribute-set, without adding "random" attributes. I used "Add Noise" operator and set parameter "random attributes = 0", it definitely generates no random attributes, but also generates no noise data I want. Is there any other parameters I should take into account?
0
Answers
the noise generated for existing attributes is completely independent from the random attributes. You may set them to 0 as you did, without getting any problems for the noise itself. Did you take a look at the operator description and set the "attribute filter type" and at least one of the noise parameters (label noise, default attribute noise, noise)?
I just created a simple example, where two attributes (att1, att2) are generated and then are copied with their respective integer values (att1_int, att2_int) to make the effect of the noise clearly visible. Then noise is added to both of them, I set the "default attribute noise" parameter to 1.0 and the results show the noise added to att1_int and att2_int.
Perhaps I misunderstood the function provided by "Add Noise" operator. I just presumed that new data records which may be treated as "noise" will be added to the ExampleSet, for example, we have 10 data records before "Add Noise", then, when "Add Noise" is processed, extra data records (ext. 5) will be added, so the total record quantity is 15 now. Do I get the wrong message?
Thanks & Best Regards
I see what you expected from the operator but I think this is the wrong choice for your task. As the description says "Adds noise to existing attributes or add random attributes." you can just add attributes, no examples. To get some new examples into your existing set you could perhaps use "Generate (Nominal) Data" and a combination of "Rename" (to adjust the attributes) and "Append" or "Union" (does not require exactly the same attributes).
But what about the generated values? Do you have some generation rule to fill in some realistic data values? Shall your noise be some unwanted data without meaning or shall it represent some perturbation to wanted values/signals (due to technical issues as measuring inaccuracy or something similar)? In the first case you should be fine with the "Generate Data" approach and insert some nonsense data as noise (for example "spam" that must be filtered out from the real information). Otherwise I would suggest to apply noise (via "Add Noise") to real data. If you want your ExampleSet to be extendend by some noise examples you might copy existing examples, add noise to them and then merge both sets together.
If you have 10 examples and want to add 5 additional "noisy" examples you could apply a "Sample" operator and randomly choose 5 examples, apply noise to them and append them to the original set: Perhaps there exist other (better) ways to solve this problem, but I think this should do it as a start. Hope this helps somehow...
Regards,
Matthias
You are so kind to give me such a detailed interpretation.
It's definitely what I want to get.
So thank you very much to help me.
Thanks & Best Regards
ZHENG