# Barnes-Hut Tree for Quick N-body Force Calculation Thank you for checking out the crate. This is Zhifeng's coding practice of implementing the Barnes-Hut Tree for accelerated N-body force calculation. The crate consists of two parts: the [BarnesHutTree] struct and the [utils] module containing a few helper functions. The [BarnesHutTree] is a data structure that accelerates the N-body force calculation by approximating and treating groups of "bodies" relatively "far" away as super nodes, not needing to loop through each individual body. To make the force calculation more customizable, the [BarnesHutTree] takes in a closure (Anonymous Function) determining whether a current node can be seen as "far enough" and a closure determining how to calculate the force or any other relationship between the target node and the super node. The [utils] module contains several closure factory functions of the repulsive force calculation mentioned in a ([paper](#reference)) about graph drawing. ## Usage Example ```rust use zhifeng_impl_barnes_hut_tree as zbht; use zbht::BarnesHutTree as BHTree; let k = 1.0; // Custom Parameters For Calculation let c = 0.2; let mut bht: BHTree<2> = BHTree::new(); bht.push(&[-1.0,1.0]); bht.push(&[1.0,1.0]); let mut ans_displacement = [0.0; 2]; let calc_fn = zbht::utils::factory_of_repulsive_displacement_calc_fn::<2>(k, c); // Since the closure implies every super node is not "far" enough, // the line calculates the exact displacement acting on value 0. let is_super_fn = |_: &[f64;2],_:&[f64;2],_:f64| -> bool {false}; // ignoring super nodes let calc_fn = zbht::utils::factory_of_repulsive_displacement_calc_fn::<2>(k, c); bht.calc_force_on_value(0, &is_super_fn, &calc_fn, &mut ans_displacement); assert_eq!(ans_displacement, [(-2.0 * k * k * c) / (2.0 * 2.0), 0.0]); let theta = 1.2; // Custom Parameter For Super Node Determination let is_super_fn = zbht::utils::factory_of_is_super_node_fn::<2>(theta); // Calculating the approximated displacement acting on value 0. bht.calc_force_on_value(0, &is_super_fn, &calc_fn, &mut ans_displacement); ``` ## Design and Naming To clarify the arguments in constructors, below are the key terms I used in the implementation. - The "body" in the context of "n-body" calculation is called `value`. (Currently, the tree treats each body as equal.) - The Barnes Hut Tree uses a tree of hypercubes to quickly find groups of nodes relatively close to each other and calculate the average position of the group of `value`s beforehand to accelerate the calculation process. - The average of a node's contained `value`s' position is called value center, `vc` for short. - The tree nodes' bounding box center, which is the center of a hypercube, is called `bc` for short. - The tree nodes' bounding box radius, which is the half-width of a hypercube, is called `br` for short. To make the design safer, I internally use `vec` to store nodes: internal and leaf nodes. When a node needs to be removed, if it is not the last node, the last node in `vec` will replace its place and update the index-based virtual "pointers". ### General Idea At first, the [BarnesHutTree] starts with a default or specified bounding hypercube. For each dimension, the hypercube's center divides the dimension into two, thus dividing the hyperspace into two to the power of a number of dimensions. For example, for two-dimensional cases, a node can divide all the values into four groups. The [BarnesHutTree] will try to recursively divide all the values until all the values are separated such that, finally, not two values directly belong to the same node. With this structure and some criteria about the relative position between a target node and the value center position of the super node, we can efficiently find out super nodes and calculate forces. But, there are many potential problems: how to handle values sitting on the boundaries, defining dimensions, handling adding values out of the initial bounding hypercube, and handling adding two too close or even identical values, which lead to infinite loops trying to separate them. ### Handling Boundary Values A hypercube's lower bound is inclusive, and its upper bound is exclusive. So, for example, for two-dimensional space, the value sitting on the center of a hypercube belongs to the node's first quadrant child node. ### Handling Dimensions The tree uses template parameters to define the dimension of the value (body)'s position. Internally, one internal node uses a `vec` to store information on its child nodes. This approach makes accessing the child nodes quick, but the space needed is two to the power of the number of dimensions. We need to be careful when using a large number of dimensions. ### Handling Out-of-root-bounding-range Values The tree will try to double its width by creating a new internal node with a larger radius in the direction of the to-include value (similarly for none or only one leaf node above the minimum radius limit). We need to be careful when the to-include value is too far away, creating numeric issues like "Infinity" or "NaN". ### Handling Too Close or Identical Values Since the tree is generally trying to put each value into separated leaves to calculate the super nodes, without additional checking and handling, trying to insert two identical values will create an infinite loop trying to put the two values into different leaf nodes. The tree currently handles two too-close values by allowing us to pre-set a limit bound that the tree node will not continue to divide when its hypercube's radius (half-width) is less than or equal to that bound. When calculating "force", the tree will loop through every value in the same leaf node except the target value itself. We need to be extra careful when setting this limit value. If the limit is too big, too many values will be held in a single leaf node, resulting in a decrease in efficiency. If all the values are in one single leaf node, the behavior and efficiency of the tree will be the same as looping through all the nodes: the default implementation of N-body calculation. Currently, even though the `f64` will eventually reach zero after some dividing, the limit should be larger than zero and be finite. Currently, the default limit is `1e-8`. ## Features ### Serialize This feature uses `serde` and `serde_json` to help serialize the tree for testing, debugging, and making visualizations. ### Unchecked This feature uses `get_unchecked`, `get_unchecked_mut`, etc, for quicker access of "virtual" "inside-vec" nodes. Based on the `cargo bench` results with `--features unchecked`, the "unchecked" feature is about 6% quicker than the default one. ## Performance The crate uses `criterion` for benchmarking and `rand` for generating random testing values. To simulate the common use cases of [BarnesHutTree], I used one round of looping through all the values, calculating their corresponding displacement and updating their positions as the benchmarking standard. | Algorithm | Number of Values (bodies) | Time | | --------------------------------------- | ------------------------- | --------- | | Double Nested Loop | 1000 | 2.44 ms | | [BarnesHutTree] | 1000 | 1.56 ms | | [BarnesHutTree] (feature = "unchecked") | 1000 | 1.46 ms | | Double Nested Loop | 10000 | 242.88 ms | | [BarnesHutTree] | 10000 | 25.09 ms | | [BarnesHutTree] (feature = "unchecked") | 10000 | 22.98 ms | ## Overall Panics The tree will panic if any 64-bit float becomes "NaN" or "Infinity" during construction and value updating processes. We should be careful with the numeric values of the values' positions before adding them to the tree. ## Reference Hu, Y. (2005). Efficient, high-quality force-directed graph drawing. _Mathematica journal, 10_(1), 37-71. I got the idea of the Barnes-Hut Tree from section four, "Barnes-Hut Force Calculation," of this paper. I have put some of my independent thoughts into my implementation (which might be error-prone). For example, in the paper, the author didn't focus on the "value removing" part of the Barnes-Hut Tree because rebuilding the octree structure is adequately efficient and accurate, so I thought about it independently. ## License: AGPL-3.0-only Zhifeng's Implementation of Barnes-Hut Tree For Accelerated N-body Calculation Copyright (C) 2024 Zhifeng Wang This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, version 3. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details. You should have received a copy of the GNU Affero General Public License along with this program. If not, see .