# My DI / Dependency Injection Library ## Brief Description and Main Features A Rust Dependency Injection (DI) library focused on simplicity and composability. Key features include: * Simple design using macros * Support for cyclic dependencies * Support for arbitrary initialization order * Working with dyn traits * Ability to use multiple structures with the same types through tagging * Usage of default traits and arbitrary functions for default arguments * The ability to not only assemble classes but also disassemble classes into components. For example, for use with configurations. This library streamlines the management of complex projects with numerous nested structures by organizing the assembly and integration of various application components, such as configurations, database connections, payment service clients, Kafka connections, and more. While not providing these components directly, the library significantly simplifies the organization and management of your application's structure if it consists of such elements. My DI ensures dependency management remains organized, easy to read, and expandable, laying a solid foundation for the growth of your project. ## How to connect the library? Simply add the dependency to your Cargo.toml: ```toml [dependencies] mydi = "0.1.2" ``` ## So, what's the problem? Why do I need this? Approaches using separate mechanisms for DI are common in other languages like Java and Scala, but not as widespread in Rust. To understand the need for this library, let's look at an example without My DI and one with it. Let's build several structures (Rust programs sometimes consist of hundreds of nested structures) in plain Rust. ### The Problem! ```rust struct A { x: u32 } impl A { pub fn new(x: u32) -> Self { Self { x } } } struct B { x: u64 } impl B { pub fn new(x: u64) -> Self { Self { x } } } struct C { x: f32 } impl C { pub fn new(x: f32) -> Self { Self { x } } } struct D { a: A, b: B, c: C } impl D { pub fn new(a: A, b: B, c: C) -> Self { Self { a, b, c } } pub fn run(self) { todo!() } } fn main() { let a = A::new(1); let b = B::new(2); let c = C::new(3.0f32); let d = D::new(a, b, c); d.run() } ``` As you can see, we write each argument in at least 4 places: * in the struct declaration, * in the constructor arguments, * in the structure fields in the constructor, * and then also substitute the arguments in the constructor. And as the project grows, all of this will become more complex and confusing. ### The Solution! Now let's try to simplify all this with My DI: ```rust use mydi::{InjectionBinder, Component}; #[derive(Component, Clone)] struct A { x: u32 } #[derive(Component, Clone)] struct B { x: u64 } #[derive(Component, Clone)] struct C { x: f32 } #[derive(Component, Clone)] struct D { a: A, b: B, c: C } impl D { pub fn run(self) { todo!() } } fn main() -> Result<(), Box> { let injector = InjectionBinder::new() .instance(1u32) .instance(2u64) .instance(3f32) .inject::() .inject::() .inject::() .inject::() .build()?; let d: D = injector.get()?; d.run() } ``` As a result, we reduced the amount of code, removed unnecessary duplication, and left only the essential code. We also opened ourselves up to further code refactoring (which we will discuss in the following sections): 1. We can now separate the structure building across different files and not drag them into a single one; for example, we can separately assemble configurations, database work, payment service clients, Kafka connections, etc. 2. We can assemble them in any order, not just in the order of initialization, which means we don't have to keep track of what was initialized first. 3. We can work with cyclic dependencies. # Testing Dependencies The library resolves dependencies at runtime, as otherwise, it would be impossible to implement features like cyclic dependencies and arbitrary initialization order. This means that dependency resolution needs to be checked somehow, and for this purpose, a test should be added. This is done very simply. To do this, you just need to call the verify method. In general, it's enough to call it after the final assembly of dependencies. For example, like this: ```rust use mydi::{InjectionBinder, Component}; fn build_dependencies(config: MyConfig) -> InjectionBinder<()> { todo!() } #[cfg(test)] mod test_dependencies_building { use std::any::TypeId; use sea_orm::DatabaseConnection; use crate::{build_dependencies, config}; use std::collections::HashSet; #[test] fn test_dependencies() { let cfg_path = "./app_config.yml"; let app_config = config::parse_config(&cfg_path).unwrap(); let modules = build_dependencies(app_config); let initial_types = HashSet::from([ // types that will be resolved somewhere separately, but for the purposes of the test, we add them additionally TypeId::of::(), TypeId::of::() ]); // the argument true means that in the errors, we will display not the full names of the structures, but only the final ones // if you are interested in the full ones, you should pass false instead modules.verify(initial_types, true).unwrap(); } } ``` # Modular Architecture and Composition ## Organizing files and folders How to organize a project with many dependencies? It may depend on your preferences, but I prefer the following folder structure: ``` - main.rs - modules -- mod.rs -- dao.rs -- clients.rs -- configs.rs -- controllers.rs -- services.rs -- ... ``` This means that there is a separate folder with files for assembling dependencies, each responsible for its own set of services in terms of functionality. Alternatively, if you prefer, you can divide the services not by functional purpose, but by domain areas: ``` - main.rs - modules -- mod.rs -- users.rs -- payments.rs -- metrics.rs -- ... ``` Both options are correct and will work, and which one to use is more a matter of taste. In each module, its own `InjectionBinder` will be assembled, and in main.rs, there will be something like: ```rust use mydi::{InjectionBinder, Component}; #[derive(Component, Clone)] struct MyApp {} impl MyApp { fn run(&self) { todo!() } } fn merge_dependencies() -> Result, Box> { let result = modules::dao::build_dependencies() .merge(modules::configs::build_dependencies()?) // Of course, during dependency assembly, something might fail .merge(modules::clients::build_dependencies()) .merge(modules::services::build_dependencies()) .merge(modules::controllers::build_dependencies()) .inject::(); Ok(result) } fn main() -> Result<(), Box> { let injector = merge_dependencies()?.build()?; let app: MyApp = injector.get()?; app.run(); Ok(()) } ``` ## Organizing a separate module So, how will the modules themselves look? This may also depend on personal preferences. I prefer to use configurations as specific instances. ```rust use mydi::InjectionBinder; pub fn build_dependencies(app_config_path: &str, kafka_config_path: &str) -> Result> { let app_config = AppConfig::parse(app_config_path)?; let kafka_config = KafkaConfig::parse(kafka_config_path)?; let result = InjectionBinder::new() .instance(app_config) .instance(kafka_config) // ... .void(); Ok(result) } ``` Meanwhile, the module for controllers might be assembled like this: ```rust use mydi::{InjectionBinder, Component}; pub fn build_dependencies() -> InjectionBinder<()> { InjectionBinder::new() .inject::() .inject::() .inject::() // ... .void() } ``` Note the `.void()` at the end. After each component is added to the `InjectionBinder`, it changes its internal type to the one that was passed. Therefore, to simplify working with types, it makes sense to convert to the type `()`, and that's what the `.void()` method is used for. # Adding Dependencies Using Macros To add dependencies, the best way is to use the derive macro Component: ```rust use mydi::{InjectionBinder, Component}; #[derive(Component, Clone)] struct A { x: u32, y: u16, z: u8, } ``` It will generate the necessary `ComponentMeta` macro, and after that, you can add dependencies through the inject method: ```rust fn main() -> Result<(), Box> { let injector = InjectionBinder::new() .instance(1u32) .instance(2u16) .instance(3u8) .inject::() .build()?; todo!() } ``` # Adding Dependencies Using Functions In some cases, using macros may be inconvenient, so it makes sense to use functions instead. For this, use the `inject_fn` method: ```rust use mydi::{InjectionBinder, Component}; #[derive(Component, Clone)] struct A { x: u32, } #[derive(Component, Clone)] struct B { a: A, x: u32, } #[derive(Clone)] struct C { b: B, a: A, x: u64, } fn main() -> Result<(), Box> { let inject = InjectionBinder::new() .inject_fn(|(b, a, x)| C { b, a, x }) .inject::() .inject::() .instance(1u32) .instance(2u64) .build()?; let x = inject.get::()?; } ``` Take note of the parentheses in the arguments. The argument here accepts a tuple. Therefore, for 0 arguments, you need to write the arguments like this `|()|`, and for a single argument, you need to write the tuple in this form `|(x, )|`. # Default Arguments To add a default value, you can use the directive `#[component(...)]`. Currently, there are only 2 available options: `#[component(default)]` and `#[component(default = my_func)]`, where my_func is a function in the scope. `#[component(default)]` will substitute the value as `Default::default()` For example, like this: ```rust #[derive(Component, Clone)] struct A { #[component(default)] x: u32, #[component(default = custom_default)] y: u16, z: u8, } fn custom_default() -> u16 { todo!() } ``` Note that custom_default is called without parentheses `()`. Also, at the moment, calls from nested modules are not supported, meaning `foo::bar::custom_default` will not work. To work around this limitation, simply use `use` to bring the function call into scope. # How to read values? As a result of dependency assembling, an injector is created, from which you can obtain the dependencies themselves. Currently, there are 2 ways to get values: getting a single dependency and getting a tuple. ```rust use mydi::{InjectionBinder, Component}; #[derive(Component, Clone)] struct A {} #[derive(Component, Clone)] struct B {} fn main() -> Result<(), Box> { let injector = InjectionBinder::new() .inject::() .inject::() .build(); let a: A = injector.get()?; // getting a single value let (a, b): (A, B) = injector.get_tuple()?; // getting a tuple todo!() } ``` Currently, tuples up to dimension 18 are supported. # Generics Generics in macros are also supported, but with the limitation that they must implement the `Clone` trait and have a `'static` lifetime: ```rust use mydi::{InjectionBinder, Component}; #[derive(Component, Clone)] struct A { x: u32 } fn main() -> Result<(), Box> { let injector = InjectionBinder::new() .instance(1u32) .instance(2u64) .inject::>() .inject::>() .build()?; let a: A = injector.get::>()?; todo!() } ``` # Circular Dependencies In some complex situations, there is a need to assemble circular dependencies. In a typical situation, this leads to an exception and a build error. But for this situation, there is a special Lazy type. It is applied simply by adding it to the inject method: ```rust use mydi::{InjectionBinder, Component, Lazy}; #[derive(Component, Clone)] struct A { x: Lazy } struct B { x: A, y: u32 } fn main() -> Result<(), Box> { let injector = InjectionBinder::new() .instance(1u32) .inject::() .inject::() .inject::>() .build()?; let a: A = injector.get::()?; todo!() } ``` Also, it's worth noting that nested lazy types are prohibited # Working with dyn traits In some cases, it makes sense to abstract from the type and work with Arc or Box. For these situations, there is a special auto trait and erase! macro. For example, like this: ```rust use mydi::{InjectionBinder, Component, erase}; #[derive(Component, Clone)] pub struct A { x: u32, } trait Test { fn x(&self) -> u32; } impl Test for A { fn x(&self) -> u32 { self.x } } fn main() -> Result<(), Box> { let inject_res = InjectionBinder::new() .inject::().auto(erase!(Arc)) .instance(1u32) .build()?; let dyn_type = inject_res.get::>()?; } ``` What's happening here? `auto` is simply adding a new dependency based on the previous type without adding it to the InjectionBinder's type. In other words, you could achieve the same effect by writing `.inject_fn(|(x, )| -> Arc { Arc::new(x) })`, but doing so would require writing a lot of boilerplate code, which you'd want to avoid. Why might we need to work with `dyn traits`? One reason is to abstract away from implementations and simplify the use of mocks, such as those from the [ mockall](https://github.com/asomers/mockall) library. But if you need to use something like `Box` instead of `Arc`, you need to use the library [ dyn-clone](https://github.com/dtolnay/dyn-clone) ```rust use mydi::{InjectionBinder, Component}; use dyn_clone::DynClone; #[derive(Component, Clone)] pub struct A { x: u32, } trait Test: DynClone { fn x(&self) -> u32; } dyn_clone::clone_trait_object!(Test); impl Test for A { fn x(&self) -> u32 { self.x } } fn main() -> Result<(), Box> { let inject_res = InjectionBinder::new() .inject::().auto(erase!(Box)) .instance(1u32) .build()?; let dyn_type = inject_res.get::>()?; } ``` # Autoboxing Since we store type information inside InjectionBinder, we can automatically create implementations for the type T for containers Arc and Box using the methods .auto_arc() and .auto_box(). ```rust #[derive(Component, Clone)] struct MyStruct {} #[derive(Component, Clone)] struct MyNestedStruct { my_struct_box: Box, my_struct_arc: Arc, } fn main() -> Result<(), Box> { let inject_res = InjectionBinder::new() .inject::().auto_box().auto_arc() .inject::() .build()?; } ``` Also, if there is a Component annotation, then the type inside Arc<...> can be passed directly to the inject method. For example, like this: ```.inject>``` It is important to note that the original type will still be available and will not be removed. # Duplicate Dependencies and Tagging In some situations, it is necessary to use multiple instances of the same type, but by default, the assembly will fail with an error if two identical types are passed. However, this may sometimes be necessary, for example, when connecting to multiple Kafka clusters, using multiple databases, etc. For these purposes, you can use generics or tagging. Example using generics: ```rust #[derive(Component, Clone)] struct MyService { config: KafkaConfig } fn main() -> Result<(), Box> { let config1: Config1 = todo!(); let config2: Config2 = todo!(); let inject_res = InjectionBinder::new() .inject::>() .inject::>() .instance(config1) .instance(config2) .build()?; todo!() } ``` You can also use tagging. For this purpose, there is a special Tagged structure that allows you to wrap structures in tags. For example, like this: ```rust // This type will be added to other structures #[derive(Component, Clone)] struct MyKafkaClient {} // These are tags, they do not need to be created, the main thing is that there is information about them in the type struct Tag1; struct Tag2; #[derive(Component, Clone)] struct Service1 { kafka_client: Tagged } #[derive(Component, Clone)] struct Service2 { kafka_client: Tagged } fn main() -> Result<(), Box> { let client1: Tagged = { let client1 = todo!(); Tagged::new(client1) }; let client2: Tagged = { let client2 = todo!(); Tagged::new(client2) }; let inject_res = InjectionBinder::new() .inject::() .inject::() .instance(client1) .instance(client2) .build()?; } ``` The Tagged type implements std::ops::Deref, which allows you to directly call methods of the nested object through it. # Expanding # Basic Expanding It's also possible not only to assemble classes but also to disassemble them into components. This can be useful in situations with configuration structs. For instance, if we have a tree of objects, we can automatically inject objects of nested struct fields. ```rust #[derive(Clone, mydi::ComponentExpander)] struct ApplicationConfig { http_сonfig: HttpConfig, cache_сonfig: CacheConfig } #[derive(Clone)] struct HttpConfig { port: u32, host: String } #[derive(Clone)] struct CacheConfig { ttl: std::time::Duration, size: usize } fn main() -> Result<(), Box> { let config1: ApplicationConfig = todo!(); let inject_res = InjectionBinder::new() .expand(config1) // .instance(config1.http_сonfig) these two substitutions will be done inside expand // .instance(config1.cache_сonfig) .build()?; todo!() } ``` # Ignoring fields during Expanding In some cases, we want to inject not all fields, but only some of them. For these scenarios, use the directive `#[ignore_expand]` ```rust #[derive(Clone, mydi::ComponentExpander)] struct ApplicationConfig { http_сonfig: HttpConfig, #[ignore_expand] // this field will now not be injected cache_сonfig: CacheConfig } #[derive(Clone)] struct HttpConfig { port: u32, host: String } #[derive(Clone)] struct CacheConfig { ttl: std::time::Duration, size: usize } fn main() -> Result<(), Box> { let config1: ApplicationConfig = todo!(); let inject_res = InjectionBinder::new() .expand(config1) // .instance(config1.http_сonfig) this substitution will be done inside expand .build()?; todo!() } ``` # Nested expanding You can also expand nested structures. To do this, use the annotation `#[nested_expand]`. It's important to note that the structure itself will not be expanded. However, if you need to expand it, you can use the `#[force_expand]` annotation. ```rust #[derive(Clone, mydi::ComponentExpander)] struct ApplicationConfig { http_сonfig: HttpConfig, #[nested_expand] /// the fields of this structure will be injected, but the structure itself won't be // #[force_expand] if you uncomment this annotation, this field will also be injected logic_config: BusinessLogicConfigs } #[derive(Clone, mydi::ComponentExpander)] struct NestedExpanding { port: u32, host: String } #[derive(Clone, mydi::ComponentExpander)] struct BusinessLogicConfigs { some_logic: LogicConfigs // this structure will be injected } #[derive(Clone)] struct LogicConfigs { } fn main() -> Result<(), Box> { let config1: ApplicationConfig = todo!(); let inject_res = InjectionBinder::new() .expand(config1) // .instance(config1.http_сonfig) this substitution will be done inside expand .build()?; todo!() } ``` # Limitations Current implementation limitations: * All types must be 'static and must implement Clone * Heap is heavily used, so no_std usage is not yet possible * It is worth noting that there can be multiple copies made at the moment of building dependencies, which should not be critical for most long-lived applications, and based on basic tests, it is performed 1-2 orders of magnitude faster than simple config parsing. # Licensing Licensed under either of Apache License, Version 2.0 or MIT license at your option. # Contribution Any contribution is welcome. Just write tests and submit merge requests # Roadmap - [ ] Better handling of default values - [ ] Add Cargo features - [ ] Add ahash support - [ ] Custom errors # Special thanks to * Chat GPT-4, which helped me write all this documentation and correct a huge number of errors in the code * Kristina, who was my inspiration * Numerous libraries in Java, Scala, and Rust that I used as references * Library authors, you are the best * Stable Diffusion, which helped me to create logo :-) # Related projects ## rust * [inject](https://docs.rs/inject/latest/inject/) * [teloc]( https://github.com/p0lunin/teloc) * [shaku]( https://github.com/AzureMarker/shaku ) * [waiter]( https://github.com/dmitryb-dev/waiter) ## java * [guice](https://github.com/google/guice) * [spring](https://github.com/spring-projects/spring-framework) ## scala * [macwire](https://github.com/softwaremill/macwire) * [izumi](https://github.com/7mind/izumi)