We can as well generate any of the previous reports using a randomly selected subset of the database, for example to select 50% of motifs at random and calculate the number of aminoacids per coordinate:
./main -d ../../database/elm_input.txt -c -m ../../database/marked_motifs.txt -r 1 -a 0.5
We can as well select a random sample of motifs, and in each of them uniformly at random we select uniformly at random a coordinate and set it to wildcard "." with the option -b %. For example if we select a motif class to modify with 0.5 of probability and the, over this modified datbaase we calculate the number of aminoacids per coordinate:
Each python script has an input directoy and an output directory. To excecute the each python script it is enought to be inside the script folder and run 'python main.py' which will generate the output and place it in output directory. The rule is the script that has "empiric" in is name should have "structure_empiric_frequency.txt" in its input folder, while the script that has "theoretic" in its name should have "structure_theoretic_probabilities.txt" in its input folder.
Each python script has an input directoy and an output directory. To excecute the each python script it is enought to be inside the script folder and run 'python main.py' which will generate the output and place it in output directory. The rule is the script that has "empiric" in is name should have "structure_empiric_frequency.txt" in its input folder, while the script that has "theoretic" in its name should have "structure_theoretic_probabilities.txt" in its input folder.