# ❄ spaceflake.rs ❄ [![Crates.io Badge](https://img.shields.io/crates/v/spaceflake.svg)](https://crates.io/crates/spaceflake) [![CI Badge](https://github.com/kkrypt0nn/spaceflake.rs/actions/workflows/ci.yml/badge.svg)](https://github.com/kkrypt0nn/spaceflake.rs/actions) [![Dependency Status Badge](https://deps.rs/repo/github/kkrypt0nn/spaceflake.rs/status.svg)](https://deps.rs/repo/github/kkrypt0nn/spaceflake.rs) [![Discord Server Badge](https://img.shields.io/discord/739934735387721768?logo=discord)](https://discord.gg/mTBrXyWxAF) [![Last Commit Badge](https://img.shields.io/github/last-commit/kkrypt0nn/spaceflake.rs)](https://github.com/kkrypt0nn/spaceflake.rs/commits/main) [![Conventional Commits Badge](https://img.shields.io/badge/Conventional%20Commits-1.0.0-%23FE5196?logo=conventionalcommits&logoColor=white)](https://conventionalcommits.org/en/v1.0.0/)
--- A distributed generator to create unique IDs with ease; inspired by [Twitter's Snowflake](https://github.com/twitter-archive/snowflake/tree/snowflake-2010). Blog post about this project can be found [here](https://krypton.ninja/Generating-unique-IDs-with-the-Snowflake-algorithm/). ## What is a Snowflake? Apart from being a crystal of snow, a snowflake is a form of unique identifier which is being used in distributed computing. It has specific parts and is 64 bits long in binary. I simply named my type of snowflake, a **Spaceflake**, as it does not compose of the same parts of a Twitter Snowflake and is being used for [Project Absence](https://github.com/ProjectAbsence) and other projects of myself. ### Structure A Spaceflake is structured like the following: ![Parts of a 64 bits Spaceflake](https://raw.githubusercontent.com/kkrypt0nn/spaceflake.rs/main/assets/spaceflake_structure.png) ## Spaceflake Network A Spaceflake Network is a very basic concept where you have multiple **independent nodes** that themselves consist of multiple workers. These workers are the ones that can generate a Spaceflake. Ideally a Spaceflake Network represents your entire application, or company. Each node represents a single server or application within the company, and each worker represents a single process which can generate a Spaceflake for a specific purpose. This way you can easily identify where a Spaceflake was generated by looking at its node ID and worker ID. In the end you are free to use them as you wish, just make sure you use these nodes and workers to be able to identify the Spaceflake. ### Example Network An example network is structured like the following ![A simple Spaceflake Network](https://raw.githubusercontent.com/kkrypt0nn/spaceflake.rs/main/assets/spaceflake_network.png) We can consider **Node 1** as being the API/backend of your application. The **Worker (ID: 1)** would be responsible for generating Spaceflakes for user IDs. The **Worker (ID: 2)** would be responsible for generating Spaceflakes for blog post IDs. The **Node 2** might be responsible for the logs of your components, and the log ID generated would be generated by the **Worker (ID: 1)** from that node. ## Some Statistics * A Spaceflake network can hold up to **31 nodes** and **31 workers per node**. So you can have up to **961 workers in total** in a single network that will generate Spaceflakes. * A **single worker** can generate up to **4095 Spaceflakes per millisecond**. * A **single node** with **31 workers** can generate up to **126'945 Spaceflakes per millisecond**. * A **single network** with **31 nodes** and **31 workers per node** can generate up to **3'935'295 Spaceflakes per millisecond**. ## Example A very basic example on using the library is by using the generator **without** nodes and worker objects, though this is not recommended and using nodes and workers is better. ```rust fn main() { let mut node = spaceflake::Node::new(1); let mut worker = node.new_worker(); let sf = worker.generate(); match sf { Ok(mut value) => { println!("Generated Spaceflake: {:#?}", value.decompose()) } Err(error) => { println!("Error: {}", error) } } } ``` Some other examples: - [Bulk generation](examples/bulk.rs): Generate multiple Spaceflakes at once. - [Generation with nodes and workers](examples/basic.rs): Generate a Spaceflake by creating a node and worker object. - [Generation with settings](examples/generate.rs): Generate a Spaceflakes with specific settings *(Not recommended, consider using nodes and workers)* ## Installation If you want to use this library for one of your projects, you can install it like any other Rust library ```shell cargo add spaceflake ``` ## ⚠️ Disclaimers ### Spaceflakes are Big Numbers > 📜 TL;DR: If you use Spaceflakes in an API, return them as a **string**, not a number. Since Spaceflakes are big numbers, it is most likely that if you use them for an API that returns a JSON you will need to return the Spaceflake as a string, otherwise you will lose some precision and it will alter the value of, most likely, the sequence of the Spaceflake. Example: ```json { "id": 144328692659220480 // ID actually generated in Rust: 144328692659220481 } ``` The difference between the two numbers is not that big in the example above, though it plays a big role. The difference is not always the same, so you can't subtract. JavaScript, for example, sees no difference between both of these numbers: ```js console.log(144328692659220480 == 144328692659220481) // true ``` You can get the Spaceflake as a string and convert to a `u64` data type, when needed, in your Rust code using the following: ### "Random" Sequence Based on Time > 📜 TL;DR: The sequence is not truly random, it is based on the time; and if you generate lots of Spaceflake in the same > millisecond, there is a chance that two Spaceflakes will result to the same. Using **nodes and workers** is highly > recommended. When generating lots of Spaceflakes in a really short time and without using a worker, there is a chance that the same ID is generated twice. Consider making your program sleep for 1 millisecond or test around between the generations, example: ```rust use spaceflake::Spaceflake; use std::collections::HashMap; use std::thread; use std::time::Duration; fn main() { let mut spaceflakes = HashMap::::new(); let settings = spaceflake::GeneratorSettings::default(); for _ in 0..1000 { let sf = spaceflake::generate(settings); match sf { Ok(value) => { if spaceflakes.get(&value.id).is_some() { panic!("Got the same Spaceflake ID twice") } spaceflakes.insert(value.id, value); } Err(error) => { println!("Error: {}", error) } } // When using random there is a chance that the sequence will be twice the same due to Rust's speed, hence using a worker is better. We wait a millisecond to make sure it's different. thread::sleep(Duration::from_millis(1)) } } ``` In that case it is recommended to use the workers, as they do not use a random value as a sequence number, but an incrementing value. Another option would be to use the [bulk generator](examples/bulk.rs) to create lots of * *unique** Spaceflakes at once. As a last resort you can replace the sequence with a better random number generator using the following: ```rust settings.sequence =... // Replace with your number generator ``` ## License This library was made with 💜 by Krypton and is under the [MIT](LICENSE.md) license.