Deployment and DevOps Standards for Rust

Security Best Practices Standards for Rust

Rust

# Security Best Practices Standards for Rust This document outlines security best practices for Rust development, providing guidelines for preventing common vulnerabilities and building secure applications. It is designed to be used by developers and AI coding assistants to ensure code adheres to high-security standards. ## 1. Input Validation and Sanitization ### 1.1. Standard: Validate All External Input **Do This:** Implement thorough input validation for all data entering your application, including user input, API responses, and data from files. **Don't Do This:** Assume that external data is safe or properly formatted without validation. **Why:** Insufficient input validation can lead to vulnerabilities such as injection attacks (SQL, command injection), buffer overflows, and denial-of-service (DoS). **Code Example:** """rust use serde::Deserialize; #[derive(Deserialize)] struct UserInput { username: String, age: u32, } fn process_input(input: &str) -> Result<(), String> { let user_input: UserInput = match serde_json::from_str(input) { Ok(parsed) => parsed, Err(_) => return Err("Invalid JSON format".to_string()), }; // Validate username if user_input.username.len() < 3 || user_input.username.len() > 50 { return Err("Username must be between 3 and 50 characters".to_string()); } // Validate age if user_input.age > 150 { return Err("Age cannot be greater than 150".to_string()); } println!("Username: {}, Age: {}", user_input.username, user_input.age); Ok(()) } fn main() { let input = r#"{"username": "valid_user", "age": 30}"#; match process_input(input) { Ok(_) => println!("Input processed successfully"), Err(e) => println!("Error: {}", e), } let invalid_input = r#"{"username": "a", "age": 200}"#; match process_input(invalid_input) { Ok(_) => println!("Input processed successfully"), Err(e) => println!("Error: {}", e), } } """ **Anti-Pattern:** Trusting input without explicit checks. """rust // Anti-pattern: No input validation fn process_unsafe_input(username: &str) { println!("Processing username: {}", username); } """ ### 1.2. Standard: Sanitize Input to Prevent Injection Attacks **Do This:** Sanitize input by encoding or escaping characters that have special meaning in the context where the data will be used (e.g., HTML, SQL, shell commands). **Don't Do This:** Directly concatenate unsanitized input into queries or commands. **Why:** Prevents injection vulnerabilities by ensuring that user-provided data is treated as data, not as executable code or commands. **Code Example: SQL Injection Prevention** Using "sqlx" for parameterization: """rust use sqlx::sqlite::{SqlitePoolOptions, SqlitePool}; use sqlx::query; async fn execute_query(pool: &SqlitePool, username: &str, email: &str) -> Result<(), sqlx::Error> { let result = query!( "INSERT INTO users (username, email) VALUES (?, ?)", username, email ) .execute(pool) .await?; println!("Rows affected: {}", result.rows_affected()); Ok(()) } #[tokio::main] async fn main() -> Result<(), sqlx::Error> { let pool = SqlitePoolOptions::new() .max_connections(5) .connect("sqlite::memory:") .await?; sqlx::migrate!("./migrations").run(&pool).await?; execute_query(&pool, "safe_user", "safe@example.com").await?; // Example usage with potentially unsafe input let username = "user'); DROP TABLE users;--"; let email = "unsafe@example.com"; // This is now safe because sqlx uses prepared statements, // escaping the special characters in the username execute_query(&pool, username, email).await?; Ok(()) } """ **Anti-Pattern:** String concatenation to build SQL queries. """rust // Anti-pattern: Vulnerable to SQL injection async fn execute_unsafe_query(pool: &SqlitePool, username: &str) -> Result<(), sqlx::Error> { let query_string = format!("SELECT * FROM users WHERE username = '{}'", username); let result = sqlx::query(&query_string) .execute(pool) .await?; println!("Rows affected: {}", result.rows_affected()); Ok(()) } """ ## 2. Memory Safety and Ownership ### 2.1. Standard: Leverage Ownership and Borrowing **Do This:** Fully utilize Rust's ownership and borrowing system to avoid common memory safety issues, such as dangling pointers, data races, and use-after-free bugs. **Don't Do This:** Disable or bypass the borrow checker without a very strong reason. Resort to "unsafe" code only when absolutely necessary and with extreme caution. **Why:** Rust's ownership and borrowing system enforces memory safety at compile time, eliminating many runtime errors endemic in other languages. **Code Example:** """rust fn process_string(data: String) { // 'data' is owned by this function println!("Processing: {}", data); } // 'data' is dropped here, memory is automatically managed fn main() { let my_string = "Hello, Rust!".to_string(); process_string(my_string); // Ownership transferred // my_string is no longer valid here // println!("{}", my_string); // This would cause a compile error } """ **Anti-Pattern:** Ignoring borrow checker errors or using "unsafe" blocks without a clear understanding of the implications. ### 2.2. Standard: Use Smart Pointers for Shared Ownership **Do This:** Use "Rc", "Arc", "RefCell", and "Mutex" appropriately when shared ownership or mutable access is required, ensuring that data is managed safely. Use "Arc" and "Mutex" for thread-safe shared mutable access. **Don't Do This:** Implement custom memory management solutions unless absolutely necessary. **Why:** Smart pointers provide safe and controlled ways to manage shared data and prevent memory leaks. **Code Example: Thread-Safe Shared Mutable State** """rust use std::sync::{Arc, Mutex}; use std::thread; fn main() { let counter = Arc::new(Mutex::new(0)); let mut handles = vec![]; for _ in 0..10 { let counter = Arc::clone(&counter); let handle = thread::spawn(move || { let mut num = counter.lock().unwrap(); *num += 1; }); handles.push(handle); } for handle in handles { handle.join().unwrap(); } println!("Result: {}", *counter.lock().unwrap()); } """ **Anti-Pattern:** Manually managing shared mutable state without proper synchronization, leading to data races. ### 2.3. Standard: Avoid "unsafe" Code When Possible **Do This:** Limit the use of "unsafe" blocks to situations where it is truly necessary (e.g., interacting with C libraries, low-level hardware access). When using "unsafe", provide detailed comments explaining why it is required and how safety is maintained. **Don't Do This:** Use "unsafe" as a workaround for borrow checker errors or performance issues without careful consideration. **Why:** "unsafe" code bypasses Rust's safety guarantees and requires manual verification of memory safety, which is error-prone. **Code Example: Interacting with a C Library** """rust use std::ffi::CString; use std::os::raw::c_char; extern "C" { fn c_function(input: *const c_char) -> i32; } fn call_c_function(input: &str) -> i32 { let c_string = CString::new(input).expect("CString::new failed"); let result = unsafe { c_function(c_string.as_ptr()) }; result } fn main() { let input = "Hello from Rust!"; let result = call_c_function(input); println!("Result from C function: {}", result); } """ **Anti-Pattern:** Using "unsafe" code without proper justification and documentation. For instance, using "unsafe" to perform unchecked array access when safe alternatives like ".get()" or ".get_mut()" are available. ## 3. Concurrency and Parallelism ### 3.1. Standard: Utilize RAII Principles in Concurrent Operations **Do This:** Apply Resource Acquisition Is Initialization (RAII) principles to manage locks and other resources in concurrent code. Use "MutexGuard" with "Mutex" and "RwLockWriteGuard" and "RwLockReadGuard" with "RwLock". **Don't Do This:** Manually acquire and release locks without RAII, which can lead to deadlocks or unlocked resources. **Why:** RAII ensures that resources are automatically released when they go out of scope, even in the presence of panics or errors. **Code Example:** """rust use std::sync::Mutex; struct SharedData { data: i32, mutex: Mutex<()>, } impl SharedData { fn new(data: i32) -> Self { SharedData { data, mutex: Mutex::new(()), } } fn modify_data(&mut self, value: i32) { let _guard = self.mutex.lock().unwrap(); // RAII: lock acquired self.data += value; // _guard is dropped here, releasing the lock } fn get_data(&self) -> i32 { let _guard = self.mutex.lock().unwrap(); // RAII: lock acquired self.data // _guard is dropped here, releasing the lock } } fn main() { let mut shared_data = SharedData::new(10); shared_data.modify_data(5); println!("Data: {}", shared_data.get_data()); } """ **Anti-Pattern:** Manually releasing locks can lead to errors if exceptions occur between the lock acquisition and release. """rust // Anti-Pattern use std::sync::{Mutex, PoisonError}; struct ManualLockExample { mutex: Mutex<i32>, } impl ManualLockExample { fn modify_data(&self) -> Result<(), PoisonError<std::sync::MutexGuard<'_, i32>>> { let lock_result = self.mutex.lock(); match lock_result { Ok(mut data) => { *data += 1; // Forget to unlock the mutex manually, leading to potential deadlocks // self.mutex.unlock(); // Hypothetical unlock function Ok(()) } Err(poisoned) => { // Handle potential lock poisoning Err(poisoned) } } } } fn main() {} """ ### 3.2. Standard: Use Message Passing for Concurrency **Do This:** Prefer message passing (using channels) for communication between threads, which reduces the risk of data races and deadlocks. **Don't Do This:** Rely solely on shared mutable state for concurrency, especially without proper synchronization primitives. **Why:** Message passing enforces data isolation and simplifies reasoning about concurrent code. **Code Example:** """rust use std::thread; use std::sync::mpsc::channel; fn main() { let (tx, rx) = channel(); thread::spawn(move || { let message = "Hello from thread!".to_string(); tx.send(message).unwrap(); }); let received = rx.recv().unwrap(); println!("Received: {}", received); } """ **Anti-Pattern:** Directly accessing and modifying shared variables from multiple threads without synchronization. ### 3.3. Standard: Avoid Deadlocks **Do This:** Design concurrent code to avoid deadlocks by ensuring consistent lock ordering or using timeouts when acquiring locks. **Don't Do This:** Acquire locks in different orders in different threads. **Why:** Deadlocks can halt program execution indefinitely and are difficult to debug. **Code Example: Avoiding Deadlock with Consistent Lock Ordering** """rust use std::sync::{Mutex, Arc}; use std::thread; struct Account { balance: i32, mutex: Mutex<()>, } fn transfer(from: Arc<Account>, to: Arc<Account>, amount: i32) { // Acquire locks in a consistent order (e.g., sort accounts by memory address) let (lock1, lock2) = if from.mutex.as_ptr() < to.mutex.as_ptr() { (from.mutex.lock().unwrap(), to.mutex.lock().unwrap()) } else { (to.mutex.lock().unwrap(), from.mutex.lock().unwrap()) }; if from.balance >= amount { from.balance -= amount; to.balance += amount; println!("Transfer successful: {} -> {}: {}", from.balance, to.balance, amount); } // Locks are released when "lock1" and "lock2" go out of scope. } fn main() { let account1 = Arc::new(Account { balance: 100, mutex: Mutex::new(()) }); let account2 = Arc::new(Account { balance: 50, mutex: Mutex::new(()) }); let account1_clone = Arc::clone(&account1); let account2_clone = Arc::clone(&account2); let handle1 = thread::spawn(move || { transfer(account1_clone, account2_clone, 20); }); let account3_clone = Arc::clone(&account1); let account4_clone = Arc::clone(&account2); let handle2 = thread::spawn(move || { transfer(account4_clone, account3_clone, 10); }); handle1.join().unwrap(); handle2.join().unwrap(); } """ **Anti-Pattern:** Acquiring locks in arbitrary order, leading to potential deadlocks. ## 4. Cryptography ### 4.1. Standard: Use Established Cryptographic Libraries **Do This:** Use well-vetted and established cryptographic libraries such as "ring", "RustCrypto crates" (e.g. "aes", "sha2"), or "sodiumoxide". **Don't Do This:** Implement custom cryptographic algorithms or primitives. **Why:** Cryptography is complex, and implementing it correctly is extremely challenging. Using established libraries ensures that you are using algorithms and implementations that have been thoroughly reviewed and tested. **Code Example: Hashing with SHA-256 using "sha2"** """rust use sha2::{Sha256, Digest}; fn main() { // Create a SHA256 hasher let mut hasher = Sha256::new(); // Process input message hasher.update(b"Hello, world!"); // Finalize the hash and get the resulting hash value let result = hasher.finalize(); println!("SHA256 Hash: {:x}", result); } """ **Anti-Pattern:** Rolling your own cryptographic functions or primitives, as this is very likely to lead to vulnerabilities. ### 4.2. Standard: Store Secrets Securely **Do This:** Use robust methods for storing secrets, such as encryption, hardware security modules (HSMs), or secure enclaves. Avoid storing secrets directly in configuration files or source code. Use environment variables sparingly. **Don't Do This:** Commit secrets directly in your code repositories or embed them in application packages. **Why:** Protects sensitive information from unauthorized access. **Code Example: Using Environment Variables for Sensitive Data** """rust use std::env; fn main() { // Retrieve API key from environment variable let api_key = match env::var("API_KEY") { Ok(key) => key, Err(_) => { eprintln!("Error: API_KEY environment variable not set."); std::process::exit(1); } }; println!("API Key: {}", api_key); // Use the API key securely in your application logic } """ **Anti-Pattern:** Hardcoding secrets and committing them into version control. ### 4.3. Standard: Handle Keys Safely **Do This:** Generate, store, and manage cryptographic keys securely. Use appropriate key lengths and algorithms based on security requirements. Protect keys from unauthorized access, modification, and disclosure. Rotate keys regularly. **Don't Do This:** Use weak or predictable keys, or store keys in insecure locations. **Why:** Compromised keys can lead to decryption of sensitive data, authentication bypass, and other security breaches. **Code Example: Key generation with ring** """rust use ring::rand::SystemRandom; use ring::signature; fn main() -> Result<(), Box<dyn std::error::Error>> { let rng = SystemRandom::new(); let pkcs8_bytes = signature::Ed25519KeyPair::generate_pkcs8(&rng)?; // Safely store or transmit pkcs8_bytes (e.g., encrypt it before storing) println!("Generated Ed25519 private key (PKCS#8 format): {:?}", pkcs8_bytes.expose()); Ok(()) } """ **Anti-Pattern:** Hardcoding or using weak keys leads to vulnerabilities. ## 5. Error Handling and Logging ### 5.1. Standard: Handle Errors Gracefully **Do This:** Implement proper error handling to prevent unexpected program termination and information leakage. Use "Result" and "Option" appropriately to handle potential errors. **Don't Do This:** Use "unwrap" or "expect" without a clear understanding of the potential for failure. **Why:** Unhandled errors can lead to application crashes and expose sensitive information. **Code Example:** """rust fn divide(a: i32, b: i32) -> Result<i32, String> { if b == 0 { return Err("Cannot divide by zero".to_string()); } Ok(a / b) } fn main() { match divide(10, 2) { Ok(result) => println!("Result: {}", result), Err(e) => println!("Error: {}", e), } match divide(10, 0) { Ok(result) => println!("Result: {}", result), Err(e) => println!("Error: {}", e), } } """ **Anti-Pattern:** Using "unwrap" directly without handling potential errors: leads to program crashes. ### 5.2. Standard: Log Security-Related Events **Do This:** Log important security-related events such as authentication attempts, authorization failures, and data modification operations. **Don't Do This:** Log sensitive information directly (e.g., passwords, API keys). Sanitize log data to prevent information leakage. **Why:** Logs provide valuable information for security monitoring, incident response, and auditing. **Code Example: Logging with "log" crate** """rust use log::{info, warn, error, LevelFilter}; use simplelog::{SimpleLogger, Config, WriteLogger, TerminalMode, ColorChoice}; use std::fs::File; fn initialize_logging() -> Result<(), Box<dyn std::error::Error>> { let config = Config::default(); // or a custom config // Log to terminal SimpleLogger::init(LevelFilter::Info, config.clone())?; // Log to file WriteLogger::init( LevelFilter::Info, Config::default(), File::create("my_application.log")?, )?; Ok(()) } fn main() -> Result<(), Box<dyn std::error::Error>> { initialize_logging()?; info!("Application started"); warn!("This is a warning message"); error!("This is an error message"); Ok(()) } """ **Anti-Pattern:** Logging sensitive data or not logging security relevant events makes debugging and security checks harder. ### 5.3. Standard: Prevent Information Leakage in Error Messages **Do This:** Avoid including sensitive information in error messages that could be exposed to unauthorized users. Use generic error messages or internal error codes. **Don't Do This:** Directly expose database error messages, file paths, or other sensitive data in error responses. **Why:** Prevents attackers from gaining information about your system or data. **Code Example:** """rust use std::io; fn read_file(path: &str) -> Result<String, String> { match std::fs::read_to_string(path) { Ok(content) => Ok(content), Err(_) => Err("Failed to read file".to_string()), // Generic error message } } fn main() { match read_file("sensitive_file.txt") { Ok(content) => println!("File content: {}", content), Err(e) => println!("Error: {}", e), // Generic error message } } """ **Anti-Pattern:** Exposing the file path or specific error via the result. ## 6. Dependencies and Supply Chain Security ### 6.1. Standard: Manage Dependencies Carefully **Do This:** Use Cargo to manage dependencies and specify version constraints in your "Cargo.toml" file. Review dependencies for security vulnerabilities. Consider using tools like "cargo audit" to scan your dependencies for known vulnerabilities. **Don't Do This:** Use dependencies from untrusted sources or without verifying their integrity. **Why:** Prevents the introduction of malicious code or known vulnerabilities into your application. **Code Example: Using Version Constraints and "cargo audit"** """toml # Cargo.toml [dependencies] rand = "0.8" # Specify a version or a version range log = "0.4" sha2 = "0.10" """ Run "cargo audit" in your project directory to check for known vulnerabilities in your dependencies. **Anti-Pattern:** Not being explicit about versions makes it harder to have reproductible installations. ### 6.2. Standard: Verify Dependency Integrity **Do This:** Use Cargo's features to verify the integrity of downloaded dependencies. Cargo automatically verifies checksums by default (defined in "Cargo.lock"). **Don't Do This:** Disable checksum verification or use dependencies without verifying their integrity. **Why:** Ensures that you are using the intended versions of dependencies and that they have not been tampered with. **Code Example:** "Cargo.lock" automatically generated """toml # Cargo.lock (example) [[package]] name = "rand" version = "0.8.5" checksum = "..." // automatically verified by cargo """ ### 6.3. Standard: Minimize Dependencies ("Dependency Hygiene") **Do This:** Regularly audit your project's dependencies and remove any that are no longer needed. Be mindful of "transitive dependencies" (dependencies of your dependencies). **Don't Do This:** Over-rely on dependencies or add dependencies without careful consideration. **Why:** Reducing the number of dependencies decreases the attack surface and reduces the risk of introducing vulnerabilities. ## 7. Web Application Security (If Applicable) ### 7.1. Standard: Protect Against Cross-Site Scripting (XSS) **Do This:** Properly escape or sanitize all user-provided data before including it in HTML output. Use templating engines that automatically provide XSS protection (e.g., Tera, Handlebars). Implement a Content Security Policy (CSP). **Don't Do This:** Directly insert unsanitized user input into HTML. **Why:** Prevents malicious scripts from being injected into your web pages and executed by users. **Code Example: Using Tera Template Engine with Escaping** """rust use tera::{Tera, Context}; use std::error::Error; fn render_template(username: &str) -> Result<String, Box<dyn Error>> { let tera = Tera::new("templates/**/*")?; // Load templates let mut context = Context::new(); context.insert("username", &username); tera.render("user_profile.html", &context).map_err(|e| e.into()) } fn main() -> Result<(), Box<dyn Error>> { // Create a template file "templates/user_profile.html": // <p>Welcome, {{ username }}!</p> (Tera automatically escapes "username") let username = "<script>alert('XSS');</script>"; let rendered_html = render_template(username)?; println!("Rendered HTML: {}", rendered_html); Ok(()) } """ Where our "templates/user_profile.html" file is: """html <!DOCTYPE html> <html> <head> <title>User Profile</title> </head> <body> <p>Welcome, {{ username }}!</p> </body> </html> """ **Anti-Pattern:** Directly embedding user input in HTML without escaping is a major XSS risk. ### 7.2. Standard: Protect Against Cross-Site Request Forgery (CSRF) **Do This:** Implement CSRF protection mechanisms such as anti-CSRF tokens or double-submit cookies. Check the "Origin" and "Referer" headers. **Don't Do This:** Rely solely on cookies for authentication without CSRF protection. **Why:** Prevents attackers from forcing users to perform actions against their will. ### 7.3. Standard: Implement Secure Authentication and Authorization **Do This:** Use strong authentication mechanisms such as multi-factor authentication (MFA). Implement proper authorization to control access to resources based on user roles and permissions. **Don't Do This:** Use weak or default credentials. **Why:** Protects sensitive resources and prevents unauthorized access. ## 8. Fuzzing ### 8.1 Standard: Implement Fuzzing **Do This:** Utilize fuzzing tools to find vulnerabilities and test security before release. Tools such as "cargo fuzz" are highly effective. **Don't Do This:** Assume your code is perfect, fuzzing helps uncover unexpected vulnerabilities. **Why:** Fuzzing is a great tool that attempts to provide unexpected input to trigger vulnerabilities **Code Example: Setting up cargo-fuzz** 1. Add "cargo-fuzz" """bash cargo install cargo-fuzz """ 2. Add to your rust project """bash cargo fuzz init """ 3. Add a fuzz target such as "fuzz_target!(|data: &[u8]| {" **Anti-Pattern:** Not fuzzing because you assume your code's perfect will often lead to undetected vulnerabilities. ## 9. Code Reviews ### 9.1 Standard: Regular Code Reviews **Do This:** Implement code reviews, leveraging checklists tailored to security issues, to uncover potential vulnerabilities before deployment. **Don't Do This:** Assume developers perfectly catch all vulnerabilities during solo development. **Why:** A fresh pair of eyes can catch misunderstandings or mistakes that a developer might overlook. **Code Example:** Implementations will vary based on team size, but an example can be: 1. Use automated systems like GitHub Actions to gate check-ins until code reviews occur 2. Have a senior member of the team approve sensitive modules. ## 10. Staying up-to-date with Rust Security ### 10.1 Standard: Continuous Learning **Do This:** Keep track of new releases and stay up to date, especially on security patches in released versions. **Don't Do This:** Assume language security stays constant forever. **Why:** Rust evolves over time and previously unknown vulnerabilities are identified. Staying up to date keeps you and your team protected. This document provides a comprehensive overview of security best practices for Rust development. By following these guidelines, developers can build more secure and resilient applications. Remember to adapt these standards to your specific project requirements and stay up-to-date with the latest security advisories and best practices.

DA

danielsoglCreated Mar 6, 2025

Core Architecture Standards for Rust

Rust

# Core Architecture Standards for Rust This document outlines the core architectural standards for Rust projects, focusing on project structure, common architectural patterns adapted for Rust's unique features, and organizational principles that promote maintainability, performance, and security. These standards are designed to be adopted by professional development teams and serve as a reference for both developers and AI coding assistants. ## 1. Project Structure and Organization A well-organized project structure is vital for long-term maintainability and collaboration. Rust's module system and crate ecosystem provide powerful tools for managing complexity. ### 1.1. Standard Project Layout * **Do This:** Adhere to the standard project layout promoted by Cargo. This includes, at a minimum: * "src/main.rs": The main entry point for executable binaries. * "src/lib.rs": The main entry point for library crates. * "src/": A directory containing all Rust source code. Organize modules and submodules within this directory. * "Cargo.toml": The project's manifest file outlining dependencies, metadata, and build configurations. * "Cargo.lock": A lockfile that specifies the exact versions of dependencies used in the project. Checked into version control. * "benches/": (Optional) Location for benchmark tests. * "examples/": (Optional) Demonstrations on how to use the crate. * "tests/": (Optional) Integration tests that treat the crate as an external dependency * **Don't Do This:** Place source code files directly in the root directory or scatter them across multiple locations. Avoid inconsistent naming conventions. * **Why:** Establishes a predictable and familiar structure for all Rust projects, making it easier for developers to navigate and understand foreign codebases. Cargo tooling relies on this structure. **Example:** """ my_project/ ├── Cargo.toml ├── Cargo.lock ├── src/ │ ├── main.rs # Executable entry point │ └── lib.rs # Library entry point ├── benches/ │ └── my_benchmark.rs ├── examples/ │ └── my_example.rs └── tests/ └── my_integration_test.rs """ ### 1.2. Module Hierarchy * **Do This:** Use Rust's module system to create a clear and logical hierarchy for your code. Group related functionalities into modules and submodules. * Organize modules based on functionality and responsibility. * Use "mod.rs" files to explicitly define submodules for easier navigation. * **Don't Do This:** Create excessively deep or flat module hierarchies. Avoid cyclic dependencies between modules. * **Why:** Improves code organization, reduces naming conflicts, and encourages code reuse. Helps with code discoverability. **Example:** """ src/ ├── lib.rs ├── api/ │ ├── mod.rs │ ├── models.rs │ └── controllers.rs └── utils/ ├── mod.rs └── logging.rs """ "src/lib.rs": """rust mod api; mod utils; pub use api::*; pub use utils::*; """ "src/api/mod.rs": """rust pub mod models; pub mod controllers; """ ### 1.3. Crate Organization * **Do This:** For larger projects, consider splitting the project into multiple crates. * Each crate should encapsulate a distinct, well-defined responsibility. * Use workspace to manage multiple crates within a single project. * Utilize feature flags to enable/disable parts of the crates. * **Don't Do This:** Create overly granular crates or giant monolithic crates. * **Why:** Improves build times, promotes code reuse across projects, and simplifies dependency management. **Example:** "Cargo.toml": """toml [workspace] members = [ "core", "api", "cli", ] """ "core/Cargo.toml": """toml [package] name = "my_project_core" version = "0.1.0" edition = "2021" """ "api/Cargo.toml": """toml [package] name = "my_project_api" version = "0.1.0" edition = "2021" [dependencies] my_project_core = { path = "../core" } """ ### 1.4. Naming Conventions * **Do This:** Follow established Rust naming conventions: * "snake_case" for variables, functions, and modules. * "PascalCase" for types (structs, enums, traits). * "SCREAMING_SNAKE_CASE" for constants and statics. * **Don't Do This:** Deviate from the standard naming conventions. Use abbreviations that are not widely understood. * **Why:** Increases code readability and maintainability by conforming to established patterns. **Example:** """rust mod user_management; // module name struct UserProfile; // struct name const MAX_USERS: u32 = 1000; // constant name fn calculate_average(numbers: &[f64]) -> f64 { // function and variable names // ... } """ ## 2. Architectural Patterns in Rust Rust's features necessitate adapting common architectural patterns to leverage its strengths and address its unique challenges. ### 2.1. Actor Model * **Do This:** Use the Actor Model for concurrent and distributed systems. * Utilize libraries like "tokio" and "async-std" for asynchronous execution. * Define actors as structs with message queues handled asynchronously. * Ensure actors communicate only by passing messages. * **Don't Do This:** Share mutable state directly between actors. Rely on unprotected global variables. * **Why:** Provides a safe and efficient way to manage concurrency, avoiding common pitfalls associated with shared mutable state. **Example:** """rust use tokio::sync::mpsc; use tokio::task; #[derive(Debug)] enum Message { Increment, GetCount(tokio::sync::oneshot::Sender<u32>), } struct CounterActor { count: u32, receiver: mpsc::Receiver<Message>, } impl CounterActor { fn new(receiver: mpsc::Receiver<Message>) -> Self { CounterActor { count: 0, receiver } } async fn run(&mut self) { while let Some(msg) = self.receiver.recv().await { match msg { Message::Increment => { self.count += 1; println!("Incremented count to {}", self.count); } Message::GetCount(tx) => { let _ = tx.send(self.count); } } } } } #[tokio::main] async fn main() { let (tx, rx) = mpsc::channel(32); let mut actor = CounterActor::new(rx); task::spawn(async move { actor.run().await; }); tx.send(Message::Increment).await.unwrap(); tx.send(Message::Increment).await.unwrap(); let (count_tx, count_rx) = tokio::sync::oneshot::channel(); tx.send(Message::GetCount(count_tx)).await.unwrap(); let count = count_rx.await.unwrap(); println!("Final count: {}", count); } """ ### 2.2. Microservices * **Do This:** Design microservices with clear boundaries and responsibilities. * Use lightweight communication protocols like REST or gRPC. * Embrace asynchronous communication when appropriate (e.g., message queues). * Implement robust error handling and monitoring. * Consider using frameworks like Actix-web or Tonic (gRPC) * **Don't Do This:** Create tightly coupled microservices that are difficult to deploy and maintain independently. * **Why:** Enables independent development and deployment, improves scalability and resilience. **Example (Actix-web):** """rust use actix_web::{web, App, HttpResponse, HttpServer, Responder}; async fn health_check() -> impl Responder { HttpResponse::Ok().body("Service is healthy") } #[actix_web::main] async fn main() -> std::io::Result<()> { HttpServer::new(|| { App::new() .route("/health", web::get().to(health_check)) }) .bind("127.0.0.1:8080")? .run() .await } """ ### 2.3. Event-Driven Architecture * **Do This:** Use an event-driven architecture for decoupled components and asynchronous processing. * Define clear event contracts (schemas). * Use message queues (e.g., RabbitMQ, Kafka) as event buses. * Handle events idempotently to avoid inconsistencies in case of failures. * **Don't Do This:** Create complex event chains that are difficult to trace and debug. * **Why:** Improves system responsiveness, scalability, and fault tolerance. Allows components to react to changes in the system without direct dependencies. ### 2.4. Clean Architecture * **Do This:** Structure your code using Clean Architecture principles to separate concerns. * Separate business logic from implementation details (frameworks, databases, UI). * Define entities, use cases, interface adapters and frameworks & drivers as distinct layers. * Dependencies should point inwards (towards business logic and entities). * Utilize Dependency Injection. * **Don't Do This:** Tie business logic directly to frameworks or database implementations. * **Why:** Promotes testability, maintainability, and adaptability. Facilitates changes to underlying technologies without affecting core business logic. ### 2.5 Data-Oriented Design (DOD) * **Do This:** Consider Data-Oriented Design (DOD) for performance-critical applications, especially in game development or high-performance computing. * Organize data in structures-of-arrays (SoA) rather than arrays-of-structures (AoS). This improves cache efficiency. * Process data in batches to reduce function call overhead. * Embrace the Entity Component System (ECS) pattern in appropriate contexts. * **Don't Do This:** Apply DOD blindly to all applications. It's most beneficial in situations where data access patterns are well-understood and performance is paramount. AoS is generally more ergonomic for smaller projects. * **Why:** Maximizes CPU cache utilization and minimizes memory access latency, leading to significant performance improvements in computationally intensive tasks. **Example (ECS):** """rust struct Position { x: f32, y: f32, } struct Velocity { dx: f32, dy: f32, } struct Entity(usize); // Simple entity ID fn main() { let mut positions: Vec<Option<Position>> = Vec::new(); let mut velocities: Vec<Option<Velocity>> = Vec::new(); let mut next_entity_id = 0; // Create entities let entity1 = create_entity(&mut positions, &mut velocities, &mut next_entity_id, Position { x: 0.0, y: 0.0 }, Velocity { dx: 1.0, dy: 0.5 }); let entity2 = create_entity(&mut positions, &mut velocities, &mut next_entity_id, Position { x: 5.0, y: 2.0 }, Velocity { dx: -0.5, dy: 0.0 }); // Movement system for i in 0..positions.len() { if let (Some(pos), Some(vel)) = (&mut positions[i], &velocities[i]) { pos.x += vel.dx; pos.y += vel.dy; println!("Entity {} moved to x: {}, y: {}", i, pos.x, pos.y); } } } fn create_entity( positions: &mut Vec<Option<Position>>, velocities: &mut Vec<Option<Velocity>>, next_entity_id: &mut usize, position: Position, velocity: Velocity, ) -> Entity { let entity_id = *next_entity_id; *next_entity_id += 1; if positions.len() <= entity_id { positions.resize_with(entity_id + 1, || None); } if velocities.len() <= entity_id { velocities.resize_with(entity_id + 1, || None); } positions[entity_id] = Some(position); velocities[entity_id] = Some(velocity); Entity(entity_id) } """ ## 3. Cross-Cutting Concerns These are concerns that impact many parts of the codebase and cannot be easily isolated within a single module. ### 3.1. Error Handling * **Do This:** Use "Result" for recoverable errors and "panic!" for unrecoverable errors. * Create a custom "Error" enum for your crate that implements "std::error::Error". * Use the "?" operator for propagating errors concisely. * Consider using the "thiserror" crate for deriving boilerplate code for error enums. * For library crates, avoid panicking unless absolutely necessary. * **Don't Do This:** Use "unwrap()" without a clear understanding of the potential for panics. Ignore errors. * **Why:** Ensures robust error handling, prevents unexpected program termination, and provides informative error messages. **Example:** """rust use std::fs::File; use std::io::{self, Read}; use thiserror::Error; #[derive(Error, Debug)] pub enum MyError { #[error("Failed to read file")] IoError(#[from] io::Error), #[error("Invalid data found")] InvalidData, } fn read_file_contents(path: &str) -> Result<String, MyError> { let mut file = File::open(path)?; let mut contents = String::new(); file.read_to_string(&mut contents)?; if contents.is_empty() { return Err(MyError::InvalidData); } Ok(contents) } fn main() { match read_file_contents("my_file.txt") { Ok(contents) => println!("File contents: {}", contents), Err(err) => eprintln!("Error: {}", err), } } """ ### 3.2. Logging * **Do This:** Use a logging framework like "log" and "env_logger" (or "tracing") for structured logging. * Configure logging levels appropriately (trace, debug, info, warn, error). * Include relevant context in log messages (timestamps, module names, user IDs). * Use structured logging to enable machine-readable logs for analysis and monitoring. "tracing" crate is better suited for that. * **Don't Do This:** Use "println!" for logging in production code. Ignore log messages. * **Why:** Provides valuable insights into application behavior, facilitates debugging and troubleshooting, and enables effective monitoring of production systems. **Example:** """rust use log::{info, warn, error, debug, trace}; fn main() { env_logger::init(); info!("Starting application"); let value = 42; debug!("The value is: {}", value); if value > 50 { warn!("Value is too high"); } else { trace!("Value is within acceptable range"); } if let Err(e) = some_fallible_operation() { error!("Operation failed: {}", e); } } fn some_fallible_operation() -> Result<(), String> { Err("Something went wrong".to_string()) } """ ### 3.3. Concurrency and Parallelism * **Do This:** Leverage Rust's ownership and borrowing system to write safe concurrent code. * Use "Mutex" and "RwLock" for protecting shared mutable state. * Use channels for communication between threads. * Consider using asynchronous programming with "tokio" or "async-std" for I/O-bound tasks. * Utilize parallel iterators with "rayon" for data-parallel computations. * **Don't Do This:** Use raw pointers for sharing mutable state without proper synchronization. Cause data races. * **Why:** Enables efficient use of multi-core processors, improves application responsiveness, and avoids common concurrency-related bugs. Rust's strong guarantees provide confidence in concurrent code. **Example (Rayon):** """rust use rayon::prelude::*; fn main() { let mut numbers: Vec<i32> = (0..100).collect(); numbers.par_iter_mut().for_each(|num| { *num *= 2; }); println!("{:?}", numbers); } """ ### 3.4. Security * **Do This:** Follow security best practices to prevent vulnerabilities. * Sanitize user inputs to prevent injection attacks. * Use secure cryptographic libraries like "ring" or "sodiumoxide". * Implement authentication and authorization mechanisms. * Be mindful of memory safety issues and avoid unsafe code whenever possible. Address any unsafe code with due diligence and extensive testing. * Use linters and static analysis tools (e.g., "cargo clippy") to identify potential security vulnerabilities. * Be aware of supply chain attacks using tools like "cargo audit" * **Don't Do This:** Store sensitive data in plaintext. Trust user inputs without validation. Ignore security warnings from compilers and linters. * **Why:** Protects application data and users from malicious attacks. Rust's memory safety features provide a strong foundation for building secure applications. ### 3.5 Performance * **Do This:** Write performance-conscious code from the beginning. * Choose appropriate data structures and algorithms for the task. * Avoid unnecessary allocations and copies. * Use profiling tools (e.g., "perf", "flamegraph") to identify performance bottlenecks. * Benchmark different implementations to find the most efficient solution. * Minimize the amount of unsafe code. * Consider using "#[inline]" to enable function inlining for performance-critical functions. * Use "cargo build --release" for optimized builds. * **Don't Do This:** Prematurely optimize code without profiling. Ignore performance regressions. * **Why:** Ensures that applications meet performance requirements and provide a good user experience. This document provides a comprehensive overview of core architecture standards for Rust projects. By adhering to these standards, development teams can build robust, maintainable, secure, and performant applications. Regular review and updates should be performed to keep the standard aligned with latest Rust developments and best practices.

DA

danielsoglCreated Mar 6, 2025

Component Design Standards for Rust

Rust

# Component Design Standards for Rust This document outlines the coding standards specifically focused on component design in Rust. These guidelines aim to promote the creation of reusable, maintainable, and performant components. Following these standards will lead to a more consistent and robust codebase. This document is designed to be used by developers and AI coding assistants alike. ## 1. Component Definition and Scope A component, in this context, refers to a self-contained, reusable module of code with a well-defined interface. It encapsulates specific functionality and minimizes external dependencies. The size and complexity of a component should be calibrated to the problem domain. ### 1.1. Single Responsibility Principle (SRP) * **Do This:** Ensure each component has one, and only one, reason to change. * **Don't Do This:** Create "god" components that handle multiple unrelated concerns because they become difficult to understand, test, and maintain. **Why:** SRP reduces coupling between different parts of the system. Changes to one component are less likely to affect other unrelated components. This improves maintainability and reduces the risk of introducing bugs during modifications. **Example:** """rust // Good: Separate components for data parsing and validation. mod data_parser { pub fn parse_data(data: &str) -> Result<Vec<String>, String> { // Implementation for parsing data. if data.is_empty() { return Err("Data is empty".to_string()); } Ok(data.split(',').map(|s| s.to_string()).collect()) } } mod data_validator { pub fn validate_data(data: &[String]) -> Result<(), String> { // Implementation for validating data. for item in data { if item.len() > 10 { return Err("Item too long".to_string()); } } Ok(()) } } fn main() { let data = "item1,item2,item3"; match data_parser::parse_data(data) { Ok(parsed_data) => { match data_validator::validate_data(&parsed_data) { Ok(_) => println!("Data is valid"), Err(err) => println!("Validation error: {}", err), } } Err(err) => println!("Parsing error: {}", err), } } """ """rust // Bad: Single component handling both parsing and validation. mod data_processor { pub fn process_data(data: &str) -> Result<Vec<String>, String> { // Parsing logic let parsed_data = data.split(',').map(|s| s.to_string()).collect::<Vec<_>>(); // Validation logic for item in &parsed_data { if item.len() > 10 { return Err("Item too long".to_string()); } } Ok(parsed_data) } } fn main() { let data = "item1,item2,item3"; match data_processor::process_data(data) { Ok(processed_data) => println!("Data is valid"), Err(err) => println!("Error: {}", err), } } """ ### 1.2. Clear Boundaries and Interfaces * **Do This:** Define clear, stable interfaces for your components using traits and public types. Leverage Rust's ownership system to ensure data integrity across component boundaries. * **Don't Do This:** Expose internal implementation details directly through public APIs. Avoid relying on global mutable state. **Why:** Well-defined interfaces abstract away the internal workings of a component, allowing you to change the implementation without affecting other parts of the system. The ownership system prevents data races and memory safety issues when passing data between components. **Example:** """rust // Good: Using traits to define a clear interface. pub trait ReportGenerator { fn generate_report(&self, data: &[String]) -> String; } pub struct HtmlReportGenerator; impl HtmlReportGenerator { pub fn new() -> HtmlReportGenerator { HtmlReportGenerator{} } } impl ReportGenerator for HtmlReportGenerator { fn generate_report(&self, data: &[String]) -> String { // Implementation for generating HTML reports. format!("<h1>Report:</h1><ul>{}</ul>", data.iter().map(|d| format!("<li>{}</li>", d)).collect::<String>()) } } // Bad: Directly exposing internal implementation details. // (Assume HtmlReportGenerator's fields were public and directly manipulated) """ ### 1.3. Explicit Dependencies * **Do This:** Explicitly declare all dependencies of a component, for example, using constructor injection. * **Don't Do This:** Rely on implicit dependencies or global state. **Why:** Explicit dependencies make it clear what a component needs to function correctly. This simplifies testing, as you can easily provide mock dependencies. It also reduces the risk of hidden dependencies causing unexpected behavior. **Example:** """rust // Good: Explicit dependency injection. struct DataProcessor<R: ReportGenerator> { report_generator: R, } impl<R: ReportGenerator> DataProcessor<R> { pub fn new(report_generator: R) -> DataProcessor<R> { DataProcessor { report_generator } } pub fn process_and_report(&self, data: &[String]) -> String { // Processing data. let processed_data = data.iter().map(|s| s.to_uppercase()).collect::<Vec<_>>(); self.report_generator.generate_report(&processed_data) } } fn main() { let html_report_generator = HtmlReportGenerator::new(); let data_processor = DataProcessor::new(html_report_generator); let data = vec!["item1".to_string(), "item2".to_string()]; let report = data_processor.process_and_report(&data); println!("{}", report); } """ ## 2. Modularity within Components Even within a relatively small component, maintain a modular approach to internal design. ### 2.1. Internal Sub-Modules * **Do This:** Divide the component into internal sub-modules for related functionality, utilizing "mod" keyword. Use visibility modifiers ("pub", "pub(crate)", "pub(super)") to control access to internal parts of components to implement information hiding. * **Don't Do This:** Lump everything into a single large module with no clear structure. **Why:** Improves readability, manageability, and helps encapsulate implementation details. Code is easier to navigate and understand, and refactoring becomes less risky. **Example:** """rust // Good: Using sub-modules for different aspects of a database connection. mod database { pub mod connection { pub fn connect(url: &str) -> Result<(), String> { // Implementation for connecting to the database. if url.is_empty() { return Err("URL is empty".to_string()); } Ok(()) } } pub mod query { pub fn execute_query(query: &str) -> Result<Vec<String>, String> { // Implementation for executing a database query. if query.is_empty() { return Err("Query is empty".to_string()); } Ok(vec!["result1".to_string(), "result2".to_string()]) } } pub mod transaction { pub fn start_transaction() { // Implementation for starting transaction } pub fn commit_transaction() { // Implementation for committing transaction } pub fn rollback_transaction() { // Implementation for rolling back transaction } } } fn main() { match database::connection::connect("example.com") { Ok(_) => println!("Connected to database"), Err(err) => println!("Error: {}", err), } match database::query::execute_query("SELECT * FROM users") { Ok(results) => println!("Results: {:?}", results), Err(err) => println!("Error: {}", err), } } """ ### 2.2. Internal Traits for Abstraction * **Do This:** Use traits internally within a component to abstract different implementations of the same functionality. * **Don't Do This:** Directly use concrete types everywhere, making it harder to change implementations later. **Why:** Allows swapping out different algorithms or strategies without modifying the rest of the component. This improves flexibility and testability. **Example:** """rust // Good: Internal trait for different data serialization formats. mod serializer { trait DataSerializer { fn serialize(&self, data: &[String]) -> Result<String, String>; } struct JsonSerializer; impl DataSerializer for JsonSerializer { fn serialize(&self, data: &[String]) -> Result<String, String> { // Implementation for serializing to JSON Ok(format!("{{\"data\": {:?}}}", data)) } } struct CsvSerializer; impl DataSerializer for CsvSerializer { fn serialize(&self, data: &[String]) -> Result<String, String> { //Implementation for serializing to CSV Ok(data.join(",")) } } pub struct DataConverter<T: DataSerializer> { serializer: T, } impl <T: DataSerializer> DataConverter<T> { pub fn new(serializer: T) -> DataConverter<T> { DataConverter { serializer } } pub fn convert_data(&self, data: &[String]) -> Result<String, String> { self.serializer.serialize(data) } } } fn main() { let data = vec!["item1".to_string(), "item2".to_string()]; let json_serializer = serializer::JsonSerializer{}; let data_converter = serializer::DataConverter::new( json_serializer); match data_converter.convert_data(&data) { Ok(serialized_data) => println!("Serialized data: {}", serialized_data), Err(err) => println!("Error: {}", err), } let csv_serializer = serializer::CsvSerializer{}; let data_converter_csv = serializer::DataConverter::new( csv_serializer); match data_converter_csv.convert_data(&data) { Ok(serialized_data) => println!("Serialized data: {}", serialized_data), Err(err) => println!("Error: {}", err), } } """ ## 3. Communication Between Components Establish clear patterns for how components interact with each other. ### 3.1. Message Passing * **Do This:** Favor asynchronous message passing (using channels from the "std::sync::mpsc" or "tokio::sync" modules or actors from crates like "actix") for communication between independent components, especially when concurrency is involved. * **Don't Do This:** Directly share mutable state between components without proper synchronization, leading to data races. **Why:** Message passing simplifies reasoning about concurrent code. It promotes loose coupling between components, as they don't need to know about each other's internal state. **Example:** """rust // Using channels for message passing. use std::sync::mpsc; use std::thread; fn main() { let (tx, rx) = mpsc::channel(); thread::spawn(move || { tx.send("Message from thread!".to_string()).unwrap(); }); match rx.recv() { Ok(message) => println!("Received: {}", message), Err(err) => println!("Error: {}", err), } } """ ### 3.2. Data Transfer Objects (DTOs) * **Do This:** When transferring data between components, especially across process boundaries, use Data Transfer Objects (DTOs) - simple data structures without behavior (structs with public fields, or enums). Consider using serialization libraries like "serde" for converting DTOs to and from different formats. * **Don't Do This:** Pass complex objects with associated behavior directly between components; this can lead to tight coupling. **Why:** DTOs decouple the data representation from the component's internal implementation, making it easier to evolve components independently. **Example:** """rust // Using DTOs and serde for serialization. use serde::{Deserialize, Serialize}; #[derive(Serialize, Deserialize, Debug)] pub struct User { pub id: u32, pub name: String, pub email: String, } fn main() { let user = User { id: 1, name: "John Doe".to_string(), email: "john.doe@example.com".to_string(), }; let serialized = serde_json::to_string(&user).unwrap(); println!("Serialized: {}", serialized); let deserialized: User = serde_json::from_str(&serialized).unwrap(); println!("Deserialized: {:?}", deserialized); } """ ## 4. Error Handling Robust error handling is essential for building reliable components. ### 4.1. Explicit Error Types * **Do This:** Define custom error types (using enums) that clearly describe the possible errors a component can return, and implement "std::error::Error" for those types. * **Don't Do This:** Return generic error types like "Box<dyn Error>" or strings, which provide little context about the error. Prefer using "thiserror" to derive the boilerplate. **Why:** Explicit error types make it easier for callers to handle errors correctly. They provide more information about the cause of the error, allowing for more specific error handling strategies. **Example:** """rust // Good: Custom error type using an enum. use thiserror::Error; #[derive(Debug, Error)] pub enum DatabaseError { #[error("Connection error: {0}")] ConnectionError(String), #[error("Query error: {0}")] QueryError(String), #[error("Transaction error: {0}")] TransactionError(String), } fn connect_to_database(url: &str) -> Result<(), DatabaseError> { if url.is_empty() { return Err(DatabaseError::ConnectionError("URL is empty".to_string())); } Ok(()) } fn main() { match connect_to_database("") { Ok(_) => println!("Connected to database"), Err(err) => println!("Error: {}", err), } } """ ### 4.2. Returning "Result" * **Do This:** Use "Result<T, E>" extensively in your component APIs to indicate the possibility of failure. This forces the caller to explicitly handle errors. * **Don't Do This:** Use "panic!" for recoverable errors. "panic!" should only be used for truly unrecoverable situations, like memory corruption or invalid program state. **Why:** "Result" makes error handling explicit and prevents errors from being silently ignored. ### 4.3. "?" Operator for Error Propagation * **Do This:** Use the "?" operator (or the "try" macro in older Rust versions) to propagate errors up the call stack. * **Don't Do This:** Manually match on "Result" and return early in every function; this is repetitive and error-prone. **Why:** The "?" operator simplifies error handling code, making it more readable and maintainable. **Example:** """rust // Using the "?" operator for concise error propagation. use thiserror::Error; #[derive(Debug, Error)] pub enum FileError { #[error("File not found: {0}")] NotFound(String), #[error("IO error: {0}")] IoError(#[from] std::io::Error), } fn read_file(path: &str) -> Result<String, FileError> { let contents = std::fs::read_to_string(path)?; Ok(contents) } fn process_file(path: &str) -> Result<(), FileError> { let contents = read_file(path)?; println!("File contents: {}", contents); Ok(()) } fn main() { match process_file("nonexistent_file.txt") { Ok(_) => println!("File processed successfully"), Err(err) => println!("Error: {}", err), } } """ ## 5. Testing Comprehensive testing is essential for ensuring component reliability. ### 5.1. Unit Tests * **Do This:** Write thorough unit tests for each component, covering all important functionality and edge cases. * **Don't Do This:** Neglect unit tests, assuming that the code "just works." **Why:** Unit tests provide confidence in the correctness of your code. They catch bugs early in the development process, before they can cause problems in production. ### 5.2. Integration Tests * **Do This:** Write integration tests to verify that different components work together correctly. These tests should simulate real-world scenarios. * **Don't Do This:** Only rely on unit tests, which may not catch integration problems. **Why:** Integration tests ensure that the system as a whole works as expected. They can uncover unexpected interactions between components. ### 5.3. Mocking * **Do This:** Use mocking frameworks (like "mockall") to isolate components during testing. This allows you to test components in isolation, without relying on external dependencies. * **Don't Do This:** Directly use real dependencies in unit tests, which can make the tests slow and brittle. **Why:** Mocking makes unit tests faster, more reliable, and easier to write. **Example:** """rust // Using mockall for mocking. #[cfg(test)] use mockall::{mock, predicate::*}; pub trait ExternalService { fn get_data(&self) -> Result<String, String>; } mock! { pub ExternalService {} impl ExternalService for ExternalService { fn get_data(&self) -> Result<String, String>; } } pub struct DataProcessor { service: Box<dyn ExternalService>, } impl DataProcessor { pub fn new(service: Box<dyn ExternalService>) -> DataProcessor { DataProcessor { service } } pub fn process_data(&self) -> Result<String, String> { let data = self.service.get_data()?; Ok(format!("Processed: {}", data)) } } #[cfg(test)] mod tests { use super::*; #[test] fn test_process_data() { let mut mock_service = MockExternalService::new(); mock_service.expect_get_data() .return_once(move || Ok("Some data".to_string())); let processor = DataProcessor::new(Box::new(mock_service)); let result = processor.process_data().unwrap(); assert_eq!(result, "Processed: Some data"); } } """ ## 6. Documentation Clear and comprehensive documentation is critical for maintainability. ### 6.1. Doc Comments * **Do This:** Write doc comments (using "///" for items and "//!" for modules/crates) to explain the purpose and usage of each component, function, and data structure. Include examples where possible. * **Don't Do This:** Neglect documentation, assuming that the code is self-explanatory. **Why:** Documentation makes it easier for other developers (and your future self) to understand and use your code. The Rust ecosystem relies heavily on autogenerated documentation, so clear doc comments are essential. ### 6.2. README Files * **Do This:** Include a README file for each component (or crate) that provides a high-level overview of its purpose, usage, and dependencies. * **Don't Do This:** Only provide code without any context or explanations. **Why:** README files provide a starting point for understanding a component. They should answer the question, "What does this do and how do I use it?" ### 6.3. Documenting Breaking Changes * **Do This:** Clearly document any breaking changes in new releases, including migration instructions. * **Don't Do This:** Introduce breaking changes without any notice or guidance. **Why:** Clear documentation of breaking changes minimizes disruption and makes it easier for users to upgrade to new versions of your components. This can be achieved via release notes. These standards provide a solid foundation for building robust and maintainable components in Rust. Following these guidelines will result in a cleaner, more reliable, and easier-to-understand codebase. Remember to adapt these standards to the specific needs of your project, while always prioritizing clarity, consistency, and code quality.

DA

danielsoglCreated Mar 6, 2025

State Management Standards for Rust

Rust

# State Management Standards for Rust This document outlines the recommended coding standards for state management in Rust. It aims to provide guidance on how to structure applications, manage data flow, and implement reactive patterns effectively using Rust's unique features and ecosystem. ## 1. Introduction to State Management in Rust State management is a critical aspect of application development, especially in Rust, where ownership and borrowing rules necessitate careful consideration. Effective state management ensures data consistency, predictability, and performance. This document focuses on best practices for managing application state, data flow, and reactivity within the context of Rust's ownership and borrowing system. ### 1.1. Why is State Management Crucial? * **Maintainability:** Well-structured state allows for easier modification and debugging. * **Predictability:** Clear state management reduces unexpected behavior and improves code reliability. * **Performance:** Efficient state handling prevents unnecessary data copying and contention. * **Security:** Correctly managing state prevents data races and other memory-related vulnerabilities. ### 1.2. Rust's Unique Challenges Rust's ownership, borrowing, and lifetime rules introduce specific challenges for state management but also provide powerful tools for building robust applications. Understanding these concepts is crucial for effective state management. * **Ownership and Borrowing:** Managing data ownership and borrowing to avoid data races and dangling pointers. * **Immutability:** Leveraging immutability by default to reduce state mutation complexity. * **Concurrency:** Using Rust's concurrency primitives safely and efficiently. ## 2. Architectural Patterns for State Management Choosing the right architectural pattern is fundamental to structuring your application's state. Several common patterns can be adapted for Rust. ### 2.1. Ownership-Based Architectures This is more of a fundamental concept than an architecture but it greatly influences architectural decisions * **Standard:** Employ Rust's ownership model to define data boundaries between components. Ensure each piece of data has a clear owner and that borrowing is used to share data without duplicating it. * **Do This:** * Clearly define ownership and borrowing relationships. * Consider using smart pointers ("Rc", "Arc", "RefCell", "Mutex") to manage shared state where necessary. * **Don't Do This:** * Rely on global mutable state without clear ownership. * Ignore the borrow checker's warnings; they often indicate potential state management issues. * **Why:** Rust's ownership model prevents data races and dangling pointers at compile time, making the code more robust and less prone to errors related to state. """rust // Example of ownership and borrowing struct Data { value: i32, } fn process_data(data: &Data) { println!("Processing data: {}", data.value); } fn main() { let data = Data { value: 42 }; process_data(&data); // Borrowing data immutably // data is still valid here because it was borrowed, not moved println!("Data value: {}", data.value); } """ ### 2.2. Entity-Component-System (ECS) ECS is a popular pattern for game development and other data-oriented applications. * **Standard:** Implement ECS with separate data structures for entities, components, and systems. Use an efficient storage mechanism for components. * **Do This:** * Consider using a crate like "specs" or "bevy" for ECS implementation. * Design components as simple data structures without behavior. * Implement logic in systems that operate on entities with specific components. * **Don't Do This:** * Store data that logically belongs in a component in the entity itself. * Create complex dependencies between systems. * **Why:** ECS separates data (components) from logic (systems), making it easier to reason about and modify application behavior. It promotes data locality and can improve performance. """rust // Example using the specs crate use specs::prelude::*; #[derive(Component, Debug)] #[storage(VecStorage)] struct Position { x: f32, y: f32, } #[derive(Component, Debug)] #[storage(VecStorage)] struct Velocity { x: f32, y: f32, } struct MovementSystem; impl<'a> System<'a> for MovementSystem { type SystemData = ( WriteStorage<'a, Position>, ReadStorage<'a, Velocity>, ); fn run(&mut self, (mut pos, vel): Self::SystemData) { for (pos, vel) in (&mut pos, &vel).join() { pos.x += vel.x; pos.y += vel.y; } } } fn main() { let mut world = World::new(); world.register::<Position>(); world.register::<Velocity>(); world.create_entity() .with(Position { x: 0.0, y: 0.0 }) .with(Velocity { x: 1.0, y: 0.5 }) .build(); let mut dispatcher = DispatcherBuilder::new() .with(MovementSystem, "movement", &[]) .build(); dispatcher.dispatch(&mut world); world.maintain(); for pos in world.read_storage::<Position>().join() { println!("Position: {:?}", pos); } } """ ### 2.3. Model-View-Controller (MVC) Classic pattern where the Model holds the application state, the View displays data, and the Controller handles user input and updates the Model. * **Standard:** Separate concerns into Model, View, and Controller. Use immutable data flow where possible. * **Do This:** * Use a framework like "Yew" (for web) or implement a custom MVC structure. * Model should be the single source of truth; View should only display data from the Model; Controller should update the Model based on user input. * Use a reactive framework to update the View when the Model changes. * **Don't Do This:** * Directly manipulate the DOM (in web applications) from the Model. * Put business logic in the View. * **Why:** MVC promotes a clear separation of concerns, making the application easier to understand, test, and maintain. """rust // Simplified MVC example (Yew) use yew::prelude::*; struct Model { value: i32, } #[function_component(MyComponent)] fn my_component() -> Html { let model = use_state(|| Model { value: 0 }); let increment = { let model = model.clone(); Callback::from(move |_| { model.set(Model { value: model.value + 1 }); }) }; html! { <div> <p>{ "Value: " }{ model.value }</p> <button onclick={increment}>{ "Increment" }</button> </div> } } fn main() { yew::Renderer::<MyComponent>::new().render(); } """ ### 2.4. Redux-like Patterns Inspired by Redux, this approach uses a single central state container, immutable state updates, and actions. * **Standard:** Implement a central store, actions to modify state, and reducers to apply actions. * **Do This:** * Use a crate like "reduxrs" or implement a custom Redux-like pattern. * Ensure state updates are immutable. * Use middleware for side effects and logging. * **Don't Do This:** * Mutate the state directly. * Perform long-running or blocking operations in reducers. * **Why:** Redux provides a predictable and debuggable state management solution, particularly useful in complex applications with many interacting components. """rust // Example using a custom Redux-like pattern use std::sync::{Arc, Mutex}; #[derive(Clone, Debug)] struct State { count: i32, } #[derive(Debug)] enum Action { Increment, Decrement, } type Reducer = fn(&State, &Action) -> State; fn reducer(state: &State, action: &Action) -> State { match action { Action::Increment => State { count: state.count + 1 }, Action::Decrement => State { count: state.count - 1 }, } } struct Store { state: Arc<Mutex<State>>, reducer: Reducer, } impl Store { fn new(reducer: Reducer, initial_state: State) -> Self { Store { state: Arc::new(Mutex::new(initial_state)), reducer, } } fn dispatch(&self, action: &Action) { let mut state = self.state.lock().unwrap(); *state = (self.reducer)(&state, action); } fn get_state(&self) -> State { self.state.lock().unwrap().clone() } } fn main() { let store = Store::new(reducer, State { count: 0 }); store.dispatch(&Action::Increment); println!("State: {:?}", store.get_state()); store.dispatch(&Action::Decrement); println!("State: {:?}", store.get_state()); } """ ## 3. Data Flow Managing data flow is crucial for ensuring data consistency and preventing race conditions. ### 3.1. Immutable Data Flow * **Standard:** Prefer immutable data flow whenever possible. * **Do This:** * Use immutable data structures and functions that return new values instead of modifying existing ones. * Consider using libraries like "im" or "rpds" for immutable data structures. * **Don't Do This:** * Mutate data directly unless absolutely necessary. * **Why:** Immutable data flow simplifies reasoning about program state and prevents unexpected side effects, leading to more robust and maintainable code. """rust // Example of immutable data flow fn increment(value: i32) -> i32 { value + 1 // Returns a new value instead of modifying 'value' } fn main() { let x = 5; let y = increment(x); println!("x: {}, y: {}", x, y); // x remains 5, y is 6 } """ ### 3.2. Reactive Data Flow * **Standard:** Use reactive programming principles to automatically propagate data changes. * **Do This:** * Consider using a reactive framework like "RxRs" or "futures" combined with channels. * Define data streams that emit events when data changes. * Use operators to transform and combine data streams. * **Don't Do This:** * Manually update dependent data whenever data changes. * **Why:** Reactive data flow simplifies complex data dependencies and ensures that changes are automatically propagated, reducing the risk of inconsistencies and improving performance. """rust // Simplified Reactive example using mpsc channels use std::sync::mpsc::{channel, Sender, Receiver}; use std::thread; fn main() { let (tx, rx): (Sender<i32>, Receiver<i32>) = channel(); thread::spawn(move || { for i in 1..=5 { println!("Sending: {}", i); tx.send(i).unwrap(); thread::sleep(std::time::Duration::from_millis(500)); } }); for received in rx { println!("Received: {}", received); } } """ ### 3.3. Error Handling for Data Flow * **Standard:** Handle errors gracefully during data flow processes to prevent data corruption and ensure application stability. * **Do This:** * Use "Result" to propagate errors. * Implement fallback mechanisms or retry logic where appropriate. * Log errors for debugging and monitoring. * **Don't Do This:** * Ignore errors or panic unexpectedly. * **Why:** Proper error handling is essential for building reliable applications that can gracefully recover from failures. """rust // Example of error handling with Result in data flow fn divide(x: i32, y: i32) -> Result<i32, String> { if y == 0 { Err("Cannot divide by zero".to_string()) } else { Ok(x / y) } } fn main() { match divide(10, 2) { Ok(result) => println!("Result: {}", result), Err(err) => println!("Error: {}", err), } match divide(5, 0) { Ok(result) => println!("Result: {}", result), Err(err) => println!("Error: {}", err), } } """ ## 4. Concurrency and Shared State Rust's ownership model is designed to prevent data races. However, shared mutable state still requires careful management. ### 4.1. Mutexes and Arc * **Standard:** Use "Mutex" and "Arc" for sharing mutable state between threads. * **Do This:** * Wrap shared data in "Arc<Mutex<T>>". * Acquire the lock before accessing or modifying shared state. * Release the lock as soon as possible to minimize contention. * **Don't Do This:** * Hold locks for extended periods. * Acquire multiple locks in an inconsistent order to avoid deadlocks. * **Why:** "Mutex" provides exclusive access to shared data, preventing data races. "Arc" allows sharing ownership of the "Mutex" across multiple threads. """rust // Example of using Mutex and Arc for shared state use std::sync::{Arc, Mutex}; use std::thread; fn main() { let counter = Arc::new(Mutex::new(0)); let mut handles = vec![]; for _ in 0..10 { let counter = Arc::clone(&counter); let handle = thread::spawn(move || { let mut num = counter.lock().unwrap(); *num += 1; }); handles.push(handle); } for handle in handles { handle.join().unwrap(); } println!("Result: {}", *counter.lock().unwrap()); } """ ### 4.2. Channels * **Standard:** Use channels for message passing between threads. * **Do This:** * Use "mpsc" (multiple producer, single consumer) or "broadcast" channels. * Send data through channels instead of sharing mutable state directly. * **Don't Do This:** * Share raw pointers or references across threads. * **Why:** Channels provide a safe and efficient way to communicate between threads without the need for explicit locking. """rust // Example of using channels for message passing use std::sync::mpsc; use std::thread; fn main() { let (tx, rx) = mpsc::channel(); thread::spawn(move || { tx.send(String::from("hello")).unwrap(); }); let received = rx.recv().unwrap(); println!("Got: {}", received); } """ ### 4.3. Atomic Types * **Standard:** Use "AtomicBool", "AtomicI32", etc., for simple atomic operations. * **Do This:** * Use atomic types for simple counters or flags. * Use atomic operations ("load", "store", "compare_and_swap") for thread-safe access. * **Don't Do This:** * Use atomic types for complex data structures. * **Why:** Atomic types provide low-level, lock-free concurrency primitives for simple operations. """rust // Example of using AtomicI32 use std::sync::atomic::{AtomicI32, Ordering}; use std::sync::Arc; use std::thread; fn main() { let atomic_counter = Arc::new(AtomicI32::new(0)); let mut handles = vec![]; for _ in 0..10 { let counter = Arc::clone(&atomic_counter); let handle = thread::spawn(move || { counter.fetch_add(1, Ordering::SeqCst); }); handles.push(handle); } for handle in handles { handle.join().unwrap(); } println!("Result: {}", atomic_counter.load(Ordering::SeqCst)); } """ ## 5. Reactivity Rust provides several options for implementing reactive patterns. ### 5.1. Futures and Async/Await * **Standard:** Use "async/await" for asynchronous programming and reactive data flow. * **Do This:** * Use "async" functions and blocks for non-blocking operations. * Use "await" to yield control and wait for the result of a future. * **Don't Do This:** * Block the main thread in "async" functions. * **Why:** "async/await" simplifies asynchronous programming and makes it easier to write non-blocking code. """rust // Example of using async/await async fn fetch_data() -> Result<String, String> { // Simulate fetching data asynchronously tokio::time::sleep(std::time::Duration::from_secs(1)).await; Ok("Data fetched successfully".to_string()) } #[tokio::main] async fn main() -> Result<(), String> { let result = fetch_data().await?; println!("{}", result); Ok(()) } """ ### 5.2. Streams * **Standard:** Use streams for handling sequences of asynchronous events. * **Do This:** * Use "futures::stream::Stream" or "tokio::stream::Stream". * Use combinators like "map", "filter", and "for_each" to process stream elements. * **Don't Do This:** * Block the stream processing logic. * **Why:** Streams provide a powerful way to handle sequences of asynchronous events and implement reactive pipelines. """rust // Example of using streams use futures::stream::{self, StreamExt}; #[tokio::main] async fn main() { let numbers = stream::iter(1..=5); numbers .map(|x| x * 2) .for_each(|x| async move { println!("Value: {}", x); }) .await; } """ ## 6. Libraries and Crates Leverage available Rust crates to simplify state management. ### 6.1. State Management Crates * "reduxrs": Implementation of the Redux pattern. * "specs": High-performance ECS library. * "bevy": Data-driven game engine built on ECS. * "im": Immutable data structures. * "rpds": Another set of persistent data structures. ### 6.2. Concurrency Crates * "tokio": Asynchronous runtime. * "async-std": Another asynchronous runtime. * "crossbeam": Concurrency primitives. ### 6.3. Reactivity Crates * "rxrs": Reactive Extensions for Rust. * "futures": Fundamental building blocks for asynchronous and concurrent programming. ## 7. Testing Testing is crucial to ensure the correctness of state management logic. ### 7.1. Unit Tests * **Standard:** Write unit tests to verify individual components and functions. * **Do This:** * Test state transitions and data flow. * Use mock objects or test doubles to isolate components. * **Don't Do This:** * Skip testing complex state management logic. * **Why:** Unit tests provide a quick and easy way to verify the correctness of individual components. """rust // Example of unit testing #[cfg(test)] mod tests { use super::*; #[test] fn test_increment() { let x = 5; let y = increment(x); assert_eq!(y, 6); } } """ ### 7.2. Integration Tests * **Standard:** Write integration tests to verify the interaction between multiple components. * **Do This:** * Test end-to-end data flows and state transitions. * Use realistic test data. * **Don't Do This:** * Rely solely on unit tests for complex state management logic. * **Why:** Integration tests ensure that components work together correctly and that the overall system behaves as expected. ## 8. Conclusion Effective state management is essential for building robust, maintainable, and performant Rust applications. By following these coding standards, developers can leverage Rust's unique features and ecosystem to manage state effectively, prevent data races, and build reliable systems. This includes selecting appropriate architectural patterns, managing data flow, handling concurrency, and implementing reactive patterns. Properly testing the state management logic is helpful in solidifying appropriate functionality. Regular review and adherence to these standards support high-quality Rust code.

DA

danielsoglCreated Mar 6, 2025

Performance Optimization Standards for Rust

Rust

# Performance Optimization Standards for Rust This document outlines coding standards specifically focused on performance optimization in Rust. It aims to guide developers in writing efficient, responsive, and resource-conscious applications. These standards are designed to be used in conjunction with other Rust coding guidelines covering style, safety, and maintainability. ## 1. Data Structures and Algorithms ### 1.1 Choose Appropriate Data Structures **Standard:** Select data structures based on the expected access patterns and performance characteristics. * **Do This:** Use "Vec" for contiguous storage and frequent iteration. Use "HashMap" or "BTreeMap" for key-value lookups, considering "HashMap" for speed and "BTreeMap" for sorted keys. Use "HashSet" for tracking unique values. * **Don't Do This:** Default to "Vec" without considering alternatives. Use "Vec" for frequent insertions/deletions in the middle. **Why:** Efficient data structures significantly impact performance. Choosing the wrong data structure for a specific use case can lead to unnecessary overhead. **Example:** """rust use std::collections::{HashMap, HashSet}; // Efficient lookup let mut map: HashMap<String, i32> = HashMap::new(); map.insert("key".to_string(), 42); let value = map.get("key"); // Efficient uniqueness check let mut set: HashSet<i32> = HashSet::new(); set.insert(1); let contains_one = set.contains(&1); // Less efficient lookup (if frequent insertions/deletions) let mut vec = Vec::new(); vec.push((String::from("key"), 42)); //searching involves iteration through the vector """ ### 1.2 Avoid Unnecessary Allocations **Standard:** Minimize dynamic memory allocations, especially in performance-critical sections. Prefer using stack allocation or reusing existing memory when possible. * **Do This:** Use "Cow<'a, T>" to avoid unnecessary clones of data. Use arena allocators in situations where a large number of allocations occur and are short lived. Explore the "smallvec" crate for stack-allocated vectors. * **Don't Do This:** Create many short-lived "String" or "Vec" instances. Allocate in loops if a single allocation could suffice. **Why:** Memory allocation is a relatively expensive operation. Excessive allocation can trigger garbage collection (in other languages), or cause cache misses, which both degrade performance. **Example:** """rust use std::borrow::Cow; fn process_data(input: Cow<str>) -> String { match input { Cow::Borrowed(s) => format!("Processed: {}", s), // Avoid allocation Cow::Owned(s) => format!("Processed: {}", s), // Process owned string } } fn main() { let static_str = "static data"; let owned_string = String::from("owned data"); let processed_static = process_data(Cow::Borrowed(static_str)); let processed_owned = process_data(Cow::Owned(owned_string)); println!("{}", processed_static); println!("{}", processed_owned); } """ ### 1.3 Optimize Algorithms **Standard:** Choose efficient algorithms with appropriate time complexity. * **Do This:** Use efficient sorting algorithms like mergesort or quicksort (Rust's "sort" uses a hybrid algorithm). Consider using specialized algorithms for specific tasks (e.g., string searching). * **Don't Do This:** Use brute-force approaches for problems that have efficient algorithmic solutions. Neglect the impact of algorithm choice. **Why:** Algorithmic efficiency has a significant impact on performance, especially for large datasets. **Example:** """rust // Inefficient: O(n^2) fn naive_search(haystack: &str, needle: &str) -> Option<usize> { for i in 0..=(haystack.len() - needle.len()) { if &haystack[i..(i + needle.len())] == needle { return Some(i); } } None } // More efficient: Use a string search algorithm like Knuth-Morris-Pratt or Boyer-Moore // (Rust's "find" method uses an optimized algorithm) fn optimized_search(haystack: &str, needle: &str) -> Option<usize> { haystack.find(needle) } """ ## 2. Concurrency and Parallelism ### 2.1 Use Threads Efficiently **Standard:** Use appropriate concurrency primitives for parallel execution, considering the overhead of thread creation and synchronization. Limit the number of threads to the number of logical cores. * **Do This:** Use thread pools or async/await for managing concurrent tasks. Use channels for communication between threads to eliminate the need for shared memory and locks. * **Don't Do This:** Create a large number of threads or spawn threads for short-lived tasks without pooling. Use shared mutable state without proper synchronization. **Why:** Threads introduce overhead due to context switching and synchronization. Incorrect use can lead to performance bottlenecks or data races. **Example:** """rust use std::thread; use std::sync::mpsc; fn main() { let n_workers = 4; let (tx, rx) = mpsc::channel(); for i in 0..n_workers { let tx = tx.clone(); thread::spawn(move || { // Perform tasks and send results tx.send(i * 2).unwrap(); }); } drop(tx); // Close the sender to signal completion for received in rx { println!("Received: {}", received); } } """ ### 2.2 Utilize "rayon" for Data Parallelism **Standard:** For embarrassingly parallel operations on collections, utilize the "rayon" crate for efficient parallel iteration. * **Do This:** Use "par_iter", "par_iter_mut", and "par_bridge" for parallel processing of collections. * **Don't Do This:** Manually create threads for simple data parallel operations. **Why:** "rayon" provides a high-level, efficient abstraction for data parallelism, simplifying thread management and minimizing overhead. **Example:** """rust use rayon::prelude::*; fn main() { let mut vec: Vec<i32> = (0..100).collect(); // Parallel processing to square each element vec.par_iter_mut().for_each(|x| *x *= *x); println!("{:?}", vec); } """ ### 2.3 Leverage Asynchronous Programming (async/await) **Standard:** Utilize async/await for I/O-bound tasks, non-blocking operations, and concurrent execution without the overhead of threads. Choose a suitable runtime like "tokio" or "async-std". * **Do This:** Use ".await" on futures to yield control back to the runtime, allowing other tasks to run. Use async versions of I/O operations. Use "select!" macro to concurrently await multiple futures. * **Don't Do This:** Block the main thread using synchronous operations when async equivalents are available. Use async/await for CPU-bound tasks (use threads instead for those). **Why:** Async/await allows a single thread to handle multiple concurrent I/O operations efficiently, improving responsiveness and reducing resource consumption. **Example:** """rust use tokio::net::TcpListener; use tokio::io::{AsyncReadExt, AsyncWriteExt}; #[tokio::main] async fn main() -> Result<(), Box<dyn std::error::Error>> { let listener = TcpListener::bind("127.0.0.1:8080").await?; loop { let (mut socket, _) = listener.accept().await?; tokio::spawn(async move { let mut buf = [0; 1024]; loop { let n = match socket.read(&mut buf).await { Ok(0) => return, Ok(n) => n, Err(e) => { eprintln!("failed to read from socket; err = {:?}", e); return; } }; if socket.write_all(&buf[0..n]).await.is_err() { eprintln!("failed to write to socket"); return; } } }); } } """ ## 3. Memory Management ### 3.1 Minimize Copying **Standard:** Reduce unnecessary data copying by using references, borrowing, and smart pointers. * **Do This:** Pass data by reference ("&") whenever possible. Utilize "Rc" or "Arc" for shared ownership. * **Don't Do This:** Clone data unnecessarily, especially large data structures. **Why:** Copying data consumes CPU cycles and memory bandwidth. Reducing copies improves performance. Rust's ownership system helps prevent accidental copies, but it is possible to thwart it using "clone()" liberally. **Example:** """rust // Pass by reference fn print_length(s: &String) { println!("Length: {}", s.len()); } fn main() { let my_string = String::from("hello"); print_length(&my_string); // Pass a reference } """ ### 3.2 Use Smart Pointers Wisely **Standard:** Choose the appropriate smart pointer based on ownership semantics: "Box" for unique ownership, "Rc" for single-threaded shared ownership, and "Arc" for thread-safe shared ownership. Use "Cell" and "RefCell" judiciously for interior mutability. * **Do This:** Use "Box" for dynamic allocation when ownership is exclusive. Use "Arc" for shared data across threads. Use "Weak" to prevent circular references with "Rc" and "Arc". Consider pin-project for self-referential data structures. * **Don't Do This:** Use "Rc" in multi-threaded contexts, or "Arc" in a single thread if "Rc" is sufficient. Overuse "Cell" and "RefCell", potentially negating borrow checker guarantees. **Why:** Smart pointers manage memory automatically and enforce ownership rules. Choosing the wrong pointer can lead to performance issues or memory leaks and undefined behavior when using "Rc" across threads. **Example:** """rust use std::sync::Arc; use std::thread; fn main() { let data = Arc::new(vec![1, 2, 3]); let mut handles = vec![]; for _ in 0..3 { let data = Arc::clone(&data); let handle = thread::spawn(move || { println!("Data: {:?}", data); }); handles.push(handle); } for handle in handles { handle.join().unwrap(); } } """ ### 3.3 Manage Lifetimes Effectively **Standard:** Understand and correctly use lifetimes to avoid dangling pointers and ensure memory safety. * **Do This:** Explicitly annotate lifetimes when necessary, especially in structs and functions that return references. * **Don't Do This:** Ignore lifetime errors or use "'static" lifetimes indiscriminately, to avoid memory safety problems. Misunderstanding lifetimes could lead to copies instead of borrows. **Why:** Correct lifetime management is crucial for memory safety and prevents undefined behavior. **Example:** """rust //Explicit lifetimes fn get_first<'a>(s: &'a str) -> &'a str { s.split(' ').next().unwrap_or("") } fn main() { let sentence = String::from("This is a sentence."); let first_word = get_first(&sentence); println!("First word: {}", first_word); } """ ## 4. Compiler Optimizations and Profiling ### 4.1 Enable Optimizations **Standard:** Ensure that optimizations are enabled in release builds. * **Do This:** Build with "cargo build --release". Configure "Cargo.toml" to enable specific optimizations. Use Link-Time Optimization (LTO) for improved cross-crate optimization. * **Don't Do This:** Deploy debug builds to production. Neglect to configure compiler optimization levels. **Why:** Compiler optimizations significantly improve performance. Release builds perform inlining, loop unrolling, and other optimizations to generate faster code. **Example:** """toml # Cargo.toml [profile.release] opt-level = 3 # Maximum optimization lto = "thin" # Link-Time Optimization codegen-units = 1 # Reduce code size panic = "abort" # Reduce binary size """ ### 4.2 Profile Your Code **Standard:** Use profiling tools to identify performance bottlenecks and optimize the most critical sections of code. * **Do This:** Use tools like "perf", "valgrind" (with callgrind), "cargo-flamegraph", and Rust's built-in benchmarking tools. * **Don't Do This:** Guess at performance bottlenecks. Optimize code without measuring its impact. **Why:** Profiling provides insights into where your application spends its time, allowing you to focus optimization efforts effectively. **Example:** """bash cargo install flamegraph cargo flamegraph --bench bench_name """ """rust #[bench] fn bench_function(b: &mut Bencher) { b.iter(|| { // Code to be benchmarked comes here }); } """ ### 4.3 Use Inline Attributes Judiciously **Standard:** Use "#[inline]" to suggest inlining small, frequently called functions. * **Do This:** Add "#[inline]" to small functions that are called in performance-critical sections. Consider "#[inline(always)]" for functions that *must* be inlined. * **Don't Do This:** Overuse "#[inline]". Don't use it for large or infrequently called functions, as it can increase code size and potentially reduce performance. **Why:** Inlining reduces function call overhead, but excessive inlining increases code size, leading to increased memory usage and potentially reduced I-cache performance. **Example:** """rust #[inline] fn add(x: i32, y: i32) -> i32 { x + y } """ ### 4.4 Leverage SIMD (Single Instruction, Multiple Data) **Standard:** Exploit SIMD instructions for parallel processing of data within vectors/arrays. * **Do This:** Explore crates like "packed_simd" or architecture-specific intrinsics to leverage SIMD instructions. * **Don't Do This:** Manually implement SIMD operations without using appropriate libraries or intrinsics. **Why:** SIMD instructions allow performing the same operation on multiple data points simultaneously, significantly speeding up data-intensive operations. **Example:** """rust //Needs nightly #![feature(portable_simd)] // Enable portable_simd feature use std::simd::{SimdI32, SimdFloat}; fn add_arrays_simd(a: &[i32], b: &[i32], result: &mut [i32]) { let simd_width = SimdI32::LANES; for i in (0..a.len()).step_by(simd_width) { let a_simd = SimdI32::from_slice(&a[i..]); let b_simd = SimdI32::from_slice(&b[i..]); let result_simd = a_simd + b_simd; result_simd.copy_to_slice(&mut result[i..]); } } """ ## 5. System-Level Considerations ### 5.1 Reduce System Calls **Standard:** Minimize the number of system calls, as they involve transitioning between user and kernel space, incurring overhead. * **Do This:** Batch operations to reduce the number of calls. Use buffered I/O. * **Don't Do This:** Make frequent small system calls, especially in loops. **Why:** System calls have higher overhead compared to user-space operations. **Example:** """rust use std::fs::File; use std::io::{BufWriter, Write}; fn write_data(data: &[u8], filename: &str) -> std::io::Result<()> { let file = File::create(filename)?; let mut writer = BufWriter::new(file); writer.write_all(data)?; writer.flush()?; // Ensure all data is written to disk Ok(()) } """ ### 5.2 Be Aware of Cache Locality **Standard:** Structure data and access it in a way that maximizes cache hits. * **Do This:** Access elements in contiguous memory locations sequentially. Use Struct of Arrays (SoA) layout when appropriate. * **Don't Do This:** Jump randomly around memory when accessing data. **Why:** Cache memory is faster than main memory. Data arranged to improve locality will result in fewer cache misses, and faster execution. **Example:** """rust // Structure of Arrays (SoA) layout struct ParticleSystem { x: Vec<f32>, y: Vec<f32>, z: Vec<f32>, } // Accessing in SoA is often more cache efficient than Array of Structures (AoS) for calculations. """ ### 5.3 Choose Efficient Serialization Formats **Standard:** Select serialization formats that balance speed, space efficiency, and compatibility. * **Do This:** Consider binary formats like Protocol Buffers, FlatBuffers, or bincode for performance-critical applications. Use text-based formats like JSON or YAML when human readability or interoperability is more important. * **Don't Do This:** Default to inefficient text-based formats without considering alternatives. **Why:** Serialization and deserialization can be performance bottlenecks, especially when dealing with large datasets. Choosing efficient formats significantly improves performance. **Example:** """rust use serde::{Deserialize, Serialize}; use bincode; #[derive(Serialize, Deserialize, PartialEq, Debug)] struct Data { a: u32, b: String, } fn main() { let data = Data { a: 42, b: "hello".to_string() }; // Serialize with bincode let encoded: Vec<u8> = bincode::serialize(&data).unwrap(); // Deserialize with bincode let decoded: Data = bincode::deserialize(&encoded[..]).unwrap(); assert_eq!(data, decoded); } """ By adhering to these coding standards, Rust developers can build high-performance applications that utilize resources efficiently, respond quickly, and maintain excellent performance characteristics. Remember to profile your code and measure the impact of optimizations to ensure effectiveness. Continuously revisit these guidelines as the Rust ecosystem evolves.

DA

danielsoglCreated Mar 6, 2025

Cline

Overview

Key Concepts

Purpose of .clinerules

File Location

Rule Structure

1. Project Overview

2. Code Standards

3. Security Rules

Best Practices

Writing Effective Rules

Common Patterns

Integration with Development Workflow

Using with Version Control

Troubleshooting

Common Issues

Examples

Basic Project Setup

Advanced Configuration

Related Rules

Security Best Practices Standards for Rust

Core Architecture Standards for Rust

Component Design Standards for Rust

State Management Standards for Rust

Performance Optimization Standards for Rust