The treestats package provides an easy to use interface to calculate summary statistics on phylogenetic trees. To obtain a list of all supported summary statistics use:
## [1] "area_per_pair" "average_leaf_depth" "avg_ladder"
## [4] "avg_vert_depth" "b1" "b2"
## [7] "beta" "blum" "cherries"
## [10] "colless" "colless_corr" "colless_quad"
## [13] "crown_age" "diameter" "double_cherries"
## [16] "eigen_centrality" "eigen_centralityW" "ew_colless"
## [19] "four_prong" "gamma" "i_stat"
## [22] "il_number" "imbalance_steps" "j_one"
## [25] "j_stat" "laplace_spectrum_a" "laplace_spectrum_e"
## [28] "laplace_spectrum_g" "laplace_spectrum_p" "max_adj"
## [31] "max_betweenness" "max_closeness" "max_closenessW"
## [34] "max_del_width" "max_depth" "max_ladder"
## [37] "max_laplace" "max_width" "mean_branch_length"
## [40] "mean_branch_length_ext" "mean_branch_length_int" "min_adj"
## [43] "min_laplace" "mntd" "mpd"
## [46] "mw_over_md" "nltt_base" "number_of_lineages"
## [49] "phylogenetic_div" "pigot_rho" "pitchforks"
## [52] "psv" "rogers" "root_imbalance"
## [55] "rquartet" "sackin" "stairs"
## [58] "stairs2" "symmetry_nodes" "tot_coph"
## [61] "tot_internal_path" "tot_path" "tree_height"
## [64] "treeness" "var_branch_length" "var_branch_length_ext"
## [67] "var_branch_length_int" "var_depth" "vpd"
## [70] "wiener"
If your favourite summary statistic is missing, please let the maintainer know, treestats is a dynamic package always under development, and the maintainers are always looking for new statistics!
Given a phylogenetic tree, you can now use of the available functions to calculate your summary statistic of choice. Let’s take for instance the Colless statistic (and we generate a dummy tree):
## [1] 324
Looking at the documentation of the colless statistic
(?colless
), we find that the function also includes options
to normalize for size: either ‘pda’ or ‘yule’:
## [1] -0.2492387
The treestats package supports calculating many statistics in one go.
For this, several functions have been set up aptly. Firstly, the
function calc_all_stats
will calculate all statistics:
Similarly, we can also blanket apply all topology associated summary statistics:
## area_per_pair average_leaf_depth avg_ladder avg_vert_depth
## 1.333010e+01 8.060000e+00 2.500000e+00 7.105528e+00
## b1 b2 beta blum
## 4.966704e+01 5.533081e+00 -3.094604e-01 1.193234e+02
## cherries colless colless_corr colless_quad
## 3.000000e+01 3.240000e+02 6.679035e-02 4.716000e+03
## diameter double_cherries eigen_centrality ew_colless
## 2.300000e+01 3.000000e+00 2.607981e-01 5.252008e-01
## four_prong i_stat il_number imbalance_steps
## 9.000000e+00 5.871037e-01 4.000000e+01 8.500000e+01
## j_one max_betweenness max_closeness max_del_width
## 8.242998e-01 1.211900e+04 NA 1.000000e+01
## max_depth max_ladder max_width mw_over_md
## 1.400000e+01 4.000000e+00 3.200000e+01 2.285714e+00
## pitchforks rogers root_imbalance rquartet
## 1.400000e+01 6.600000e+01 6.500000e-01 4.371309e+06
## sackin stairs stairs2 symmetry_nodes
## 8.060000e+02 6.666667e-01 5.976075e-01 6.600000e+01
## tot_coph tot_internal_path tot_path_length var_depth
## 6.905000e+03 6.080000e+02 1.414000e+03 5.228687e+00