Federated learning-based detection and control mechanism of in-car navigation safety system

doi:10.21203/rs.3.rs-3165556/v1

Download PDF

Research Article

Federated learning-based detection and control mechanism of in-car navigation safety system

https://doi.org/10.21203/rs.3.rs-3165556/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

The advancement of in-car navigation systems has dramatically improved driving experiences. However, ensuring the safety of these systems remains a critical concern. Federated learning provides a new solution for cooperative learning between non-mutually trusted entities. Through the mode of local training and central aggregation, the local data privacy of each entity is protected while training the global model. To achieve this, a federated learning method for deep learning that preserves privacy is developed by integrating differential privacy with secure multi-party computing. In this scheme, vehicles add perturbations to the local models obtained by local training and secretly share them with multiple central servers. The scheme protects the local information uploaded by users from being stolen and prevents the adversary from malicious inference from globally shared information such as the aggregation model. Additionally, the scheme enables users dropping out and implements a variety of aggregating methods. The aforementioned system may also easily be expanded to decentralized scenarios for real-world applications devoid of a trustworthy center. The experimental findings show that, in order to protect sensitive data obtained from in-car navigation systems during learning, the suggested strategy heavily emphasizes privacy protection. Simultaneously, the high accuracy achieved through the proposed federated learning scheme significantly enhances in-car navigation safety systems' detection and control capabilities. It enables precise and reliable event detection, differentiation of abnormal situations, and reduces false alarms, improving overall safety, user trust, and system performance.

In-car navigation

Federated learning

Deep learning

Differential privacy

Recently, in-car navigation systems have become integral to modern vehicles, providing drivers with real-time guidance and enhancing their overall driving experience. These systems rely on GPS technology, map data, and intelligent algorithms to offer accurate and efficient navigation [1, 2]. However, the increasing complexity and reliance on in-car navigation systems pose significant challenges in ensuring their safety and reliability. Traditional approaches to detecting and controlling safety issues in these systems often involve centralized analysis, which may compromise data privacy and result in performance limitations [3]. Therefore, there is a need for a decentralized mechanism that can enhance the accuracy and efficiency of safety detection and control in in-car navigation systems while preserving data privacy [4, 5]. Ensuring the safety of in-car navigation systems is paramount to preventing accidents and mitigating potential hazards. Traditional approaches to safety mechanisms have relied on centralized systems that analyze data from individual vehicles. However, this approach raises concerns about data privacy and scalability. Additionally, centralization may result in performance limitations and delays in detecting and responding to safety issues.

Federated learning is a privacy-preserving approach that enables the collaborative training of machine learning models across a network of devices or vehicles without sharing raw data [6, 7]. In the context of in-car navigation safety systems, each vehicle possesses a dataset consisting of sensor readings, location information, and other relevant data collected during driving. Instead of transmitting the raw data to a central server for analysis, federated learning allows the vehicles to perform local model training using their respective datasets. The training process involves iteratively updating the model parameters based on the local data. The updated model parameters are then sent to a central server, aggregating them with the parameters from other vehicles [8]. This aggregation step is crucial as it combines the knowledge learned from different vehicles without exposing their specific data [9]. Various techniques, such as secure aggregation protocols and encryption, can be employed to preserve privacy during the model aggregation process. By leveraging the collective intelligence of a network of vehicles, the resulting model captures the common patterns and insights from diverse driving scenarios, enhancing the overall detection and control mechanisms of the in-car navigation safety system. The decentralized nature of federated learning ensures that sensitive information, such as specific driving routes or personally identifiable information, remains on local devices and is not shared with external parties [10]. This privacy-preserving aspect is crucial when dealing with sensitive data, ensuring compliance with privacy regulations, and building trust among vehicle owners and users of in-car navigation systems.

This study suggests an effective federated learning approach that protects privacy. Based on the multi-server-multi-client architecture, the client downloads the global model, trains with local data, adds noise to the updated parameters obtained by training, and secretly shares them with all servers. The servers perform secure multi-party computation based on the shared shares and obtain the shared shares of the aggregate result. The client downloads all the shares, recovers the aggregate results, and updates the model. After analysis, this method protects privacy, tolerates dropped calls, is compatible with various aggregation functions, and is easy to extend to decentralized scenarios.

The rest of the paper is organized as follows. Section 2 reviews the related work. Section 3 presents the methodology. The simulation and results analysis are presented in Section 4, and Section 5 concludes the paper.

2.1. In-car navigation systems

In-car navigation systems are advanced technological solutions integrated into vehicles to provide drivers with real-time guidance and assistance. These systems utilize a combination of hardware, software, and data to offer drivers accurate directions, map displays, and various features to enhance their driving experience. Global positioning system (GPS) technology forms the foundation of in-car navigation systems [11]. GPS receivers in vehicles receive signals from satellites to determine the vehicle's precise location on Earth. This information is then used to provide accurate navigation guidance. In-car navigation systems rely on detailed map data, including road networks, landmarks, points of interest (POIs), and traffic information [12, 13]. Maps are preloaded in the system or accessed through online connectivity, enabling real-time updates and dynamic routing. In-car navigation systems feature displays integrated into the vehicle's dashboard or infotainment system. These displays show maps, turn-by-turn directions, and other relevant information. User interfaces allow drivers to input destinations, adjust settings, and interact with the navigation system easily. In-car navigation systems often provide voice-guided instructions to drivers, ensuring hands-free operation and minimizing distractions. Voice prompts guide drivers through turns, lane changes, and other maneuvers, enhancing safety and convenience [14, 15]. In-car navigation systems calculate optimal routes based on the selected destination, considering traffic conditions, road closures, and real-time data. Traffic information can be sourced from various providers, including GPS probes, sensors, and crowd-sourced data. In addition to navigation, in-car systems offer information about nearby points of interest, including restaurants, gas stations, hotels, and attractions. Users can search for specific POIs or browse categories to find relevant services [16]. Modern in-car navigation systems may incorporate lane assistance, speed limit alerts, collision warnings, and integration with other vehicle systems (e.g., climate control, entertainment). These features enhance safety, convenience, and overall driving experience. In-car navigation systems have become increasingly sophisticated, providing drivers with accurate and intuitive guidance while adapting to changing road conditions. They have revolutionized people's navigation, reducing travel time, improving efficiency, and enhancing overall road safety.

2.2. Federated learning in safety systems

Federated learning is a machine learning approach that has gained significant attention in developing safety systems, including in-car navigation safety systems. It offers unique advantages for preserving data privacy and improving the accuracy and reliability of safety mechanisms [17–20]. Federated learning enables multiple vehicles or devices to collaborate in training a shared machine learning model without sharing their raw data. Each vehicle locally trains the model using its dataset, which consists of relevant sensor readings, location information, and safety-related data. The training process occurs on the individual vehicles, ensuring that sensitive information remains on the device and is not exposed to external parties. This decentralized approach enhances data privacy and addresses concerns about sharing personal driving data [21, 22]. Once the local training is complete, the updated model parameters are securely aggregated without revealing the specific data from each vehicle. Aggregation methods such as secure multi-party computation or differential privacy techniques can be used to ensure privacy during the model parameter merging process [23]. Combining the knowledge from multiple vehicles, the resulting model captures a comprehensive understanding of navigation safety patterns, including potential hazards, driving behaviors, and road conditions. Federated learning facilitates the development of robust hazard detection mechanisms in in-car navigation safety systems. By training the machine learning model on diverse datasets from various vehicles, the system can detect and recognize a wide range of safety risks, such as road obstructions, pedestrians, erratic driving behavior, or adverse weather conditions. The decentralized nature of federated learning enables real-time hazard detection by leveraging the collective intelligence of the networked vehicles. Continuous Learning and Improvement: Federated learning allows continuous model refinement and improvement. As new data is collected by vehicles, the models can be periodically retrained, incorporating the latest information to adapt to evolving safety challenges. This iterative learning process ensures that the safety system remains up-to-date and effectively addresses emerging hazards and scenarios. Scalability and Adaptability: Federated learning is well-suited for large-scale deployments of in-car navigation safety systems. It can accommodate a network of vehicles without relying on centralized infrastructure, making it scalable and adaptable to different vehicle types, models, and manufacturers. This flexibility promotes widespread adoption and collaboration among stakeholders in the automotive industry, fostering collective efforts to enhance safety [24]. By employing federated learning techniques, in-car navigation safety systems can benefit from collective intelligence and data privacy preservation, improving hazard detection, real-time responsiveness, and overall road safety.

3.1. General idea and framework

Federated learning allows data nodes to perform multiple rounds of local model training locally and then upload the local model to a central node for parameter aggregation, thus avoiding transmitting the original data across nodes and protecting data privacy to a certain extent. The core idea of the FedAvg algorithm is to use intermediate information, such as model parameters, to replace the original data to transmit between nodes [25]. However, this intermediate information is often the "refinement" of the knowledge contained in the original data, and there is still a risk of privacy leakage when exposed to adversaries. In this paper, privacy leakage is mainly divided into two categories. (i) Privacy leakage caused by local information exposure. (ii) Privacy leakage caused by global information exposure.

This paper proposes the following scheme ideas to resist two types of privacy leakage risks. (i) The adversary can reconstruct the local dataset from the data uploaded by a client in each round. In this paper, we use secure multi-party computation to hide the data uploaded by the client in each round and ensure that the server can summarize the uploaded data to obtain the correct aggregation results to avoid the leakage of local information. (ii) Considering that the amount of information in the data only decreases with the computation or processing, when the adversary cannot steal the personal data uploaded by the user, the closest information to the original data can be obtained is the aggregation model in each round. In this paper, based on the idea of local differential privacy, the client adds a specific perturbation to the local model obtained by local training and uploads the perturbed model to the server so that the aggregation process of each round satisfies differential privacy, that is, whether a sample of a client participates in the training or not, the distribution of the global model after aggregation does not change significantly [26]. Thus, the aggregation model can be prevented from being exploited by adversaries.

The overall framework of the proposed model is shown in Fig. 1. Participating nodes include (i) \(n\) clients \({C}_{1},{C}_{2},\cdots ,{C}_{n}\), responsible for local storage of their private datasets. (ii) \(m\) servers \({S}_{1},{S}_{2},\cdots ,{S}_{m}\), \(m\ge 2\), responsible for aggregate calculation of data shares. There is a secure channel between the client and the server. Table 1 lists some notations and descriptions used in this paper.

Table 1

Symbols and descriptions
Symbol	Description
\({S}_{i}\)	\(i\)th server node
\({C}_{i}\)	\(i\)th client node
\({D}_{i}\)	Local dataset of\({C}_{i}\)
\(\left\|{D}_{i}\right\|\)	The number of samples that \({D}_{i}\) contains
\(N\)	Minimum of\(\left\|{D}_{i}\right\|\)
\({M}^{r}\)	Global model of \(r\)th round
\({M}_{i}^{r}\)	Local model of the client \({C}_{i}\) in the \(r\)th round
\({M}_{i,j}^{r}\)	Model shares uploaded by \({C}_{i}\) to \({S}_{j}\) in the \(r\)th round
\({M}_{\ast ,j}^{r}\)	Aggregate share of server \({S}_{j}\) in the \(r\)th round
\(R\)	Total number of training rounds
\(C\)	Upper bound on the L2 norm
\(K\)	Lower bound on the number of clients
\(B\)	Size of mini-batch
\(E\)	The number of iterations of the client traverses the dataset

3.2. Threat model

The system mainly has three types of roles: client, server, and external adversary. This paper mainly considers the first two types of internal adversaries who directly participate in the training process and are more threatening.

Server. Assume that the server is semi-honest. It can correctly execute the algorithm and protocol process, but it will try to infer more private information based on the collected data. Simultaneously, it is assumed that the number of colluding server opponents is less than the threshold \(t\) of secret sharing, with (\(n,n\))-threshold secret sharing scheme as an example, assuming that there is at least one honest server.

Client. Assuming that the client is semi-honest, the goal of the adversary client is to obtain the relevant information of the honest client's training data by viewing the interactive content rather than uploading maliciously tampered data that will reduce the accuracy of the model or even cause the training not to converge. Simultaneously, the number of colluding clients is assumed to be less than \(n-1\). Otherwise, for the reversible aggregation function \(F\left({d}_{1},\cdots ,{d}_{n}\right)\), the colluding node can infer the input of the only honest node through the output and the known \(n-1\) inputs.

External adversary. The model is deployed to a node or cloud to provide prediction services after training. The adversary can analyze the output from limited access to the model interface and try to infer local data on a client. Considering the knowledge and ability of the adversary, the external adversary cannot obtain the intermediate information of the training process, so the attack's success rate is often lower than the above two types of internal adversaries.

3.3. Training process

The algorithm incorporates the following parameters: a set of servers denoted as \(S=\left\{{S}_{1},{S}_{2},\cdots ,{S}_{m}\right\}\), where m is greater than or equal to 2, and a set of clients represented by \(C=\left\{{C}_{1},{C}_{2},\cdots ,{C}_{n}\right\}\), with \(n\) being greater than or equal to 3. Each client corresponds to a local dataset, \(D=\left\{{D}_{1},{D}_{2},\cdots ,{D}_{n}\right\}\). It is assumed that a minimum number of \(K\) clients upload parameters per round. The machine learning algorithm employed, denoted as \(L\), is consistently executed by all clients during their local training. In this study, the optimization algorithm utilized to train the model \(M\) is gradient descent, with the model architecture declared prior to the training process. The primary focus of this research lies in training neural networks. The parameters for differential privacy are \(\epsilon\) and \(\delta\), where smaller values correspond to higher degrees of privacy protection. The maximum number of colluding servers is denoted as \(t{\prime }\), and the threshold of the secret sharing scheme should exceed this value. Lastly, the total number of training rounds is denoted as \(R\).

The specific procedure of the algorithm for training a privacy-preserving federated learning model referred to as Algorithm 1, is presented as follows within an academic context. Initially, the server initializes the model parameters, denoted as \({S}_{1}\). Subsequently, the client downloads the model and employs its local dataset for training, which results in acquiring new model parameters. A sequence of operations is conducted on the local model to maintain control over the sensitivity of the aggregated model parameters. Initially, the local model is trimmed and compressed, followed by the addition of qualified noise. The resulting model is then shared among all servers in a secretive manner. In this context, the FedAvg weighted average technique aggregates the model parameters. To facilitate clarity and simplicity in the algorithm presentation, each client is assumed to employ a dataset of equal size for local training. After a specific duration, allowing for adequate parameter updates, the server locally averages the parameter shares to obtain the aggregated shares. The client subsequently downloads the aggregate shares from each server to reconstruct the secret and obtain the updated model parameters. Repeating these steps can inform the training process until the desired objective is achieved.

Secret sharing and secure computation protocols are commonly designed based on algebraic structures like finite fields or commutative rings. However, these structures are not directly applicable to real-world data scenarios. Consequently, it becomes essential to appropriately encode data and establish a mapping relationship with the aforementioned algebraic structures. In our approach, we transfer the model parameters to the ring \({Z}_{2}^{l}\), where fixed-point numbers with \(l\) bits represent the actual parameters. Within this representation, the lower \(e\) bits are allocated for the decimal places. To illustrate, consider the floating-point parameter \(x\) in Step14 of the local model \({M}_{i}^{r}\). Its encoded form, denoted as \(x{\prime }\), is obtained using the expression \({x}^{{\prime }}=\text{i}\text{n}\text{t}\left(x\ast {2}^{e}\right)\), where int denotes the rounding operation. In this study, we set \(l\) to be 64 and \(e\) to be 32, allowing us to store the encoded data using the int64 data type. Notably, the encoded fixed-point number exhibits a maximum range of expression defined as \(\left[{-2}^{l-e-1}+{2}^{-e},{2}^{l-e-1}-{2}^{-e}\right]\). On the other hand, given an encoded value \(x{\prime }\), the decoding process involves a simple computation of \(x=x{\prime }/{2}^{e}\), enabling the retrieval of the original parameter.

Consider two numbers, x, and y, that are shared among n nodes, denoted as \(\left[x\right]=\left\{{x}_{1},{x}_{2},\cdots ,{x}_{n}\right\}\) and \(\left[y\right]=\left\{{y}_{1},{y}_{2},\cdots ,{y}_{n}\right\}\), where each node \(i\) possesses \({x}_{i}\) and \({y}_{i}\). To compute the share of \(x+y\), each node independently computes \({x}_{i}+{y}_{i}\). Consequently, in Step19 of Algorithm 1, \({S}_{i}\) adds the local shares to obtain the sum of the local models. It is also straightforward to observe that constant multiplication is performed locally. Given a share \(\left[x\right]\) and a constant \(c\), each node can locally compute \({c\ast x}_{i}\) to obtain the share \(\left[cx\right]\). The \({\sum }_{i=1}^{n}{c\ast x}_{i}\) can be obtained through secret recovery as \(c{\sum }_{i=1}^{n}{x}_{i}\), which ultimately yields \(cx\). Notably, since the parameter \(K\) in Step19 is a constant value for the server, the computation for the average share can also be carried out locally.

The resilience of Algorithm 1 to client disconnection is evident. Considering that the participating data nodes in the training process often consist of unstable mobile edge devices, the privacy protection scheme must ensure the effectiveness of the training process even when nodes experience periods of disconnection. In the context of Algorithm 1, if a client becomes disconnected, it results in the absence of the share of the model update being sent to the server. In such scenarios, the client is treated as non-participating in the current round of training. Notably, since all shares are simultaneously transmitted to all servers after executing the SecShr algorithm, the algorithm assumes that no client can selectively send shares to specific servers. However, if such a situation occurs, the server can efficiently resolve it by conducting an additional round of communication. This additional round would confirm the source client IDs for all received shares and subsequently intersect them when performing the aggregation process.

Additionally, Algorithm 1 demonstrates compatibility with more intricate custom aggregation functions. While FedAvg utilizes a weighted average as the aggregation operation for parameters, more is needed to meet the demands of complex application requirements. For instance, to combat Byzantine attacks, some researchers have proposed the computation of the median among all client update values, which is then employed as the aggregation result. Unlike privacy-preserving schemes based on homomorphic encryption or function encryption that solely support linear aggregation operations, the proposed algorithm enables the computation of any complex aggregation function \(g({x}_{1},{x}_{2},\cdots ,{x}_{n})\). This is achieved by redefining Step19 of Algorithm 1 as \({M}_{j}^{r}\leftarrow \text{S}\text{e}\text{c}\text{C}\text{o}\text{m}\text{p}\left(g\right({M}_{{i}_{1},j}^{r},\cdots ,{M}_{{i}_{K},j}^{r}\left)\right)\), where the secure multi-party computation protocol \(\text{S}\text{e}\text{c}\text{C}\text{o}\text{m}\text{p}\) may introduce additional communication among servers. The volume and number of communication rounds in \(\text{S}\text{e}\text{c}\text{C}\text{o}\text{m}\text{p}\) are influenced by the specific aggregation function \(g\).

Lastly, a trusted center is often relied upon in existing federated learning frameworks. However, in this paper, using secure multi-party computation for parameter aggregation naturally extends to decentralized scenarios. In such scenarios, each party serves as a data node and a computation node, conducting local model training and assuming responsibility for secure parameter aggregation. Specifically, the parties involved are denoted as \(\left\{{C}_{1},{C}_{2},\cdots ,{C}_{n}\right\}=\left\{{S}_{1},{S}_{2},\cdots ,{S}_{m}\right\}\), where \(m\) equals \(n\). Adopting a (\(n,n\))-threshold secret sharing scheme ensures that each party is not required to place trust in other participants, and their respective parameters cannot be reconstructed.

This study primarily focuses on analyzing and evaluating the proposed federated learning scheme based on three key aspects: privacy, efficiency, and usability. To conduct the experiments, each client or server within the scheme is assigned an experimental node consisting of an 8-core / 32GB cloud host instance. All nodes are configured to operate within the same subnet. As the accuracy of the resulting model is independent of whether the nodes are situated in an actual distributed environment, a simulated federated learning training process involving multiple nodes is executed on a 16-core / 64GB cloud host. Local model training is implemented using PyTorch, while secure multi-party computation, differential privacy, and inter-node communication are implemented using Python3. The MNIST dataset is selected as both the training and test sets and a convolutional neural network architecture is adopted for the training model. Specifically, the model comprises two convolutional layers with a 5×5 convolution kernel size, 10 and 20 output channels, a stride of 1, and valid padding. A 2×2 Max pooling layer and ReLU activation function are applied following the convolutional layers. Lastly, the model consists of two fully connected layers with dimensions of (320, 50) and (50, 10), respectively. Additionally, a dropout of 0.5 is applied after the second convolutional layer and the first fully connected layer [27]. Figure 2 visually represents the model architecture and a single forward computation process.

4.1. Efficiency analysis

The practicality of a federated learning scheme hinges on its operational efficiency. This study employs secure multiparty computation utilizing secret sharing to safeguard the security of the aggregation process. Furthermore, differential privacy is employed to ensure the security of the aggregation results. However, it should be noted that the transmission of shared shares introduces additional communication overhead. Additionally, the steps involved in secret sharing, share calculation, secret recovery, model pruning, and noise addition introduce additional computational overhead.

To assess the practical execution efficiency of Algorithm 1, the FedAvg algorithm was employed as a benchmark for comparison. The objective was to demonstrate that the proposed scheme does not suffer from significant efficiency losses while enhancing privacy. Figure 3 illustrates the variation in cumulative time consumption of Algorithm 1 and FedAvg as the number of rounds increases. The experiments were conducted with different local dataset sizes for each client, namely 600, 3000, and 6000. The remaining parameters were set as follows: \(B=\infty\), \(E=3\), \(n=k=10\), \(m=2\), and \(R=100\).

It can be seen that the efficiency of Algorithm 1 is very similar to FedAvg in the experiment, and the reasons are analyzed as follows.

(1) Single server traffic does not increase. Before each round of model aggregation in FedAvg, server \(S\) receives a total amount of data equal to \(Kkl\) bits. In this context, \(K\) represents the number of models participating in the aggregation, \(k\) denotes the number of parameters included in a single model, and \(l\) represents the number of bits occupied by each parameter. Specifically, the parameter type employed is float64, implying that \(l\) equals 64. However, in the proposed scheme, the parameters are truncated and represented as fixed-point numbers, with each parameter being stored as an int64 type. Despite this alteration, the communication data volume for any server \({S}_{j}\) remains unchanged at \(Kkl\) bits, where \(l\) remains equal to 64.

(2) Increasing the number of servers introduces no additional time overhead. When the network conditions across all nodes are comparable and \(n>m\), the time consumed for communication is typically determined by when the server receives all the messages. Despite the proposed method necessitating multiple servers, thereby resulting in an augmented overall global communication volume in each round, the total time consumption does not experience a substantial increase when compared to FedAvg. This is primarily attributed to the fact that the communication volume for an individual server remains unchanged, and the messages are concurrently transmitted and received between nodes.

(3) Computing the aggregated model has no additional communication overhead. In Algorithm 1, secure computation plays a crucial role in the aggregation process by performing averaging. Only \(K-1\) share additions and one constant division are required during a single aggregation. These two types of computations can be executed locally by the server without interaction, substantially reducing communication overhead.

With the growing size of the client dataset, the disparity in efficiency between the proposed method and FedAvg remains the same. This reduction can be attributed to the fact that as the client dataset expands, the increase in local training time becomes more significant compared to the communication time. Similarly, augmenting each client's local training rounds (\(E\)) further narrows the efficiency gap between the two methods. Efficiency tests were conducted on more complex datasets and models to validate these assertions. The Cifar100 dataset was employed as the training dataset, while ResNet-50 served as the training model [28]. The parameters were set as follows: \(B=\infty\), \(E=3\), \(n=k=10\), \(m=2\), and \(R=100\). The parameter values for \(N\) and \(E\) were deliberately reduced to accentuate the efficiency disparity between the proposed scheme and FedAvg. The results are illustrated in Fig. 4. Notably, when there are merely 250 training samples in each client, the discrepancy in efficiency between the two methods is distinctly observable. However, as \(N\) increases to 500 and 1000, the efficiency of the two methods becomes highly comparable.

Subsequently, the scalability of Algorithm 1 is evaluated by examining the efficiency variations under different scenarios involving varying numbers of clients and servers. In these tests, each client's local dataset is held constant at 3000 samples, while the other parameters remain consistent with the settings depicted in Fig. 3. The experimental outcomes are presented in Fig. 5 to demonstrate the efficiency changes.

Figure 5(a) illustrates the average time consumption per round as the number of clients increases, with the number of servers fixed at 2. The graph shows that, apart from fluctuations caused by network conditions, the average time consumption per round of the proposed scheme grows linearly with the number of clients. The rate of increase is approximately 0.0125 (seconds per client), indicating that the overall efficiency of the scheme remains within an acceptable range as the number of participating nodes increases. Figure 5(b) demonstrates the average time consumption per round when the number of clients is fixed at ten, and the number of servers varies. In this experiment, the number of clients is not less than the number of servers (\(n\ge m\)), and the network conditions of both clients and servers are similar. Figure 5(c) illustrates that the average time consumption does not change significantly compared to the centralized scenario. As the number of participating nodes increases, the average time consumption remains relatively stable.

The efficiency of the federated learning scheme proposed in this paper plays a crucial role in the detection and control of in-car navigation safety systems. These systems necessitate timely and accurate detection and control mechanisms to respond to potential hazards effectively. The efficiency of the federated learning scheme directly impacts the speed at which the aggregated model can be updated and deployed to the in-car navigation systems. A more efficient scheme enables faster model updates, facilitating quicker detection and control of safety-related events. Efficient federated learning is particularly significant for in-car navigation systems due to their limited computing resources and operation under constrained network conditions. By optimizing the efficiency of the federated learning process, the scheme reduces the computational and communication overhead associated with model aggregation and updating. This optimization allows the system to utilize available resources effectively without excessive strain. The proposed scheme strongly emphasizes privacy preservation to safeguard sensitive data collected from in-car navigation systems during the learning process. By executing the federated learning algorithm efficiently, the scheme minimizes the exposure of raw data to external entities, thereby mitigating privacy risks. Consequently, users' trust and confidence in the system are enhanced, fostering active participation. In-car navigation safety systems operate in large-scale environments with numerous interconnected vehicles. The efficiency of the federated learning scheme influences its scalability to accommodate an increasing number of clients and servers. A highly efficient scheme can handle a more extensive system's growing computational and communication demands, ensuring seamless and effective detection and control across various vehicles. An efficient federated learning scheme facilitates real-time responsiveness, optimal resource utilization, enhanced privacy preservation, and scalability. These factors collectively contribute to the effectiveness and reliability of the detection and control mechanisms in in-car navigation safety systems.

4.2. Usability analysis

The model's accuracy plays a crucial role in determining the usability of a federated learning scheme. In secure multi-party computation, computations are performed over finite fields or commutative rings. However, user data is typically represented using fixed-point numbers, requiring truncation during the computation process. Additionally, introducing noise due to differential privacy mechanisms can also impact the accuracy performance of the model. This subsection focuses on conducting experiments to evaluate the proposed algorithm's influence on the model's accuracy. Specifically, for each client, a local dataset consisting of 6000 randomly selected samples is used (\(N=6000\)). The remaining parameters are set as follows: \(B=\infty\), \(E=3\), \(C=10\), \(n=k=100\), \(m=2\), \(R=100\), and \(\delta =0.0001\). These experimental settings allow for assessing how the proposed algorithm affects the model's accuracy in a controlled environment.

Table 2 presents the model test accuracy of both FedAvg and the proposed method after 100 rounds of communication, considering different privacy settings with overall privacy budgets of 1 and 0.5. Notably, when no noise is added, and data truncation is performed, the model's accuracy remains unaffected. As the privacy parameter ε decreases, the degree of privacy protection the learning algorithm provides improves, resulting in more significant amounts of added noise. Consequently, the model's accuracy gradually decreases, and even the convergence of the model may be impacted. Figure 6 provides insights into this relationship, demonstrating that when \(\epsilon\) is less than 0.0005 and the noise level (\(\delta\)) exceeds 0.29, the model's prediction accuracy is notably poor. Excessive noise significantly hinders the typical iteration of the model, leading to a failure in achieving convergence. However, within the 0.0005<\(\epsilon\)<0.0006, the model performance demonstrates significant improvement. As \(\epsilon\) increases beyond 0.003, the model gradually stabilizes and attains the desired effect, indicating that the added noise no longer substantially hinders the model's convergence.

Table 2

Comparison of different approaches on model prediction accuracy
Methods	Test accuracy
FedAvg	97.12
Ours	97.10
Ours (\(\epsilon =0.01\))	96.59
Ours (\(\epsilon =0.005\))	95.99

The proposed federated learning scheme's high accuracy significantly impacts the detection and control of in-car navigation safety systems. In-car navigation safety systems rely on accurate and reliable detection mechanisms to identify potential hazards and ensure timely control actions. By achieving high model accuracy through the federated learning scheme, these systems' detection and control capabilities are greatly enhanced. A high accuracy model in the federated learning scheme enables more precise and reliable predictions, improving the system's ability to detect safety-related events such as collisions, obstacles, or hazardous road conditions. Accurate detection allows the system to respond promptly, triggering appropriate control actions to mitigate or avoid potential risks. This contributes to enhancing overall safety for passengers and vehicles on the road. Moreover, the high accuracy of the model enhances the system's ability to differentiate between normal driving conditions and abnormal or anomalous situations. This is particularly important for identifying critical events that require immediate attention, such as sudden lane departures, aggressive driving behaviors, or potential mechanical failures. By accurately detecting such events, the system can activate appropriate control mechanisms, such as issuing warnings or adjusting vehicle settings, to ensure safe operation and prevent accidents. Additionally, the high accuracy of the federated learning scheme improves the reliability of the system's predictions, reducing false positives and false negatives. This minimizes the occurrence of unnecessary control interventions or missed detection of actual safety threats, leading to a more efficient and effective overall detection and control process. The impact of high accuracy extends beyond the detection and control mechanisms themselves. It also fosters user trust and confidence in the in-car navigation safety system. When users have confidence in the system's ability to detect and respond to safety-related events accurately, they are more likely to rely on the system and follow its recommendations. This promotes greater user acceptance and utilization of the system, leading to improved overall safety outcomes. In conclusion, the high accuracy achieved through the proposed federated learning scheme significantly enhances in-car navigation safety systems' detection and control capabilities. It enables precise and reliable event detection, differentiation of abnormal situations, and reduces false alarms, improving overall safety, user trust, and system performance.

In in-car navigation systems, this study provides a federated learning strategy for deep learning that protects privacy. To protect the privacy of local data and computing processes, the suggested scheme integrates secure multi-party computation with differential privacy techniques. By adding perturbations to local models and securely sharing them with central servers, the scheme prevents unauthorized access to sensitive information and malicious inference from shared data. The scheme also accommodates user dropouts and supports various aggregation functions. Furthermore, it can be extended to decentralized scenarios, eliminating the need for a trusted central authority. The experimental results demonstrate the effectiveness of the proposed scheme in preserving privacy and enhancing the accuracy of in-car navigation systems. By emphasizing privacy preservation, sensitive data collected from in-car navigation systems is safeguarded during learning. The high accuracy achieved through the federated learning scheme significantly improves these systems' detection and control capabilities. It enables precise and reliable event detection, differentiation of abnormal situations, and reduces false alarms, ultimately enhancing overall safety, user trust, and system performance.

While there are limitations to consider, first, the proposed scheme assumes the availability of a reliable and secure communication infrastructure. The efficiency and performance of the scheme may be affected in scenarios with limited network resources or high latency. Second, the scheme's scalability should be further investigated, especially when dealing with many participants or complex datasets. The scheme's robustness against sophisticated attacks and adversarial scenarios also requires further exploration. In future research, addressing these limitations and exploring potential enhancements is essential. This could involve investigating communication-efficient protocols to improve the scheme's performance under constrained network conditions. Moreover, exploring techniques to enhance the scalability of the scheme and handle larger-scale deployments will be beneficial. Additionally, advancing the security aspects of the scheme to withstand adversarial attacks and ensuring robustness will be crucial for real-world applications.

Ethics Approval: Authors confirm that this paper is not under consideration in other journals.

Consent for Publication: Not Applicable.

Conflicts of Interest: All authors declare that there is no conflicts of interest in this paper.

Funding Statement: This paper has not received any funding support yet.

Availability of Data and Materials: Data available on request from the corresponding author.

Skog I, Handel P (2009) In-Car Positioning and Navigation Technologies-A Survey. IEEE Trans Intell Transp Syst 10:4–21
Galvao ML, Krukar J, Schwering A (2021) Evaluating schematic route maps in wayfinding tasks for in-car navigation. Cartography and Geographic Information Science 48:449–469
Lin PQ, Zhou CH, Cheng YA (2022) Systematic Cooperation Method for In-Car Navigation Based on Future Time Windows. Promet - Traffic&Transportation 34:381–396
Arulprakash M, Jebakumar R (2021) People-centric collective intelligence: decentralized and enhanced privacy mobile crowd sensing based on blockchain. J Supercomput 77:12582–12608
Lin AY, Ling Q (2015) Decentralized and Privacy-Preserving Low-Rank Matrix Completion. J Oper Res Soc China 3:189–205
Zhang K, Song X, Zhang C et al (2022) Challenges and future directions of secure federated learning: a survey. Front Comput Sci 16:165817
Ratnayake H, Chen L, Ding X (2023) A review of federated learning: taxonomy, privacy and future directions. J Intell Inf Syst. https://doi.org/10.1007/s10844-023-00797-x
Gupta R, Alam T (2022) Survey on Federated-Learning Approaches in Distributed Environment. Wirel Pers Commun 125:1631–1652
Pillutla K, Laguel Y, Malick J et al (2023) Federated learning with superquantile aggregation for heterogeneous data. Mach Learn. https://doi.org/10.1007/s10994-023-06332-x
Qammar A, Karim A, Ning H et al (2023) Securing federated learning with blockchain: a systematic literature review. Artif Intell Rev 56:3951–3985
Hebblewhite W, Gillett AJ (2021) Every step you take, we’ll be watching you: nudging and the ramifications of GPS technology. AI & Soc 36:863–875
Park E, Kim KJ (2014) Driver acceptance of car navigation systems: integration of locational accuracy, processing speed, and service and display quality with technology acceptance model. Pers Ubiquit Comput 18:503–513
Allison CK, Stanton NA (2020) Constraining Design: Applying the Insights of Cognitive Work Analysis to the Design of Novel In-Car Interfaces to Support Eco-Driving. Automot Innov 3:30–41
Gierlich HW (2018) Voice Recognition and In-Car Communication Testing Procedures and Performance Parameters. ATZextra Worldw 23(Suppl 2):32–37
Biswas A, Sahu PK, Chandra M (2016) Multiple cameras audio visual speech recognition using active appearance model visual features in car environment. Int J Speech Technol 19:159–171
Psyllidis A, Gao S, Hu Y et al (2022) Points of Interest (POI): a commentary on the state of the art, challenges, and prospects for the future. Comput Urban Sci 2:20
Sirohi D, Kumar N, Rana PS et al (2023) Federated learning for 6G-enabled secure communication systems: a comprehensive survey. Artif Intell Rev. https://doi.org/10.1007/s10462-023-10417-3
Yang Q, Huang A, Fan L et al (2023) Federated Learning with Privacy-preserving and Model IP-right-protection. Mach Intell Res 20:19–37
Han G, Zhang T, Zhang Y et al (2022) Verifiable and privacy preserving federated learning without fully trusted centers. J Ambient Intell Human Comput 13:1431–1441
Wen J, Zhang Z, Lan Y et al (2023) A survey on federated learning: challenges and applications. Int J Mach Learn & Cyber 14:513–535
Wang Y, Tian Y, Yin X et al (2020) A trusted recommendation scheme for privacy protection based on federated learning. CCF Trans Netw 3:218–228
Boualouache A, Engel T (2022) Federated learning-based scheme for detecting passive mobile attackers in 5G vehicular edge computing. Ann Telecommun 77:201–220
Liu J, Huang J, Zhou Y et al (2022) From distributed machine learning to federated learning: a survey. Knowl Inf Syst 64:885–917
Qu H, Wang K, Zhao J (2023) Survivable SFC deployment method based on federated learning in multi-domain network. J Supercomput. https://doi.org/10.1007/s11227-023-05382-1
McMahan HB, Moore E, Ramage D et al (2017) Communication-Efficient Learning of Deep Networks from Decentralized Data. Artif Intell Stat PMLR, 1273–1282
Xue Q, Zhu Y, Wang J (2022) Mean estimation over numeric data with personalized local differential privacy. Front Comput Sci 16:163806
Garbin C, Zhu X, Marques O (2020) Dropout vs. batch normalization: an empirical study of their impact to deep learning. Multimed Tools Appl 79:12777–12815
Zhang Y, Wang G, Yang T et al (2022) Compression of deep neural networks: bridging the gap between conventional-based pruning and evolutionary approach. Neural Comput & Applic 34:16493–16514

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Federated learning-based detection and control mechanism of in-car navigation safety system

Status:

Version 1

Abstract

Figures

1. Introduction

2. Related works

2.1. In-car navigation systems

2.2. Federated learning in safety systems

3. Proposed Mechanism

3.1. General idea and framework

3.2. Threat model

3.3. Training process

4. Simulation and results analysis

4.1. Efficiency analysis

4.2. Usability analysis

5. Conclusion

Declarations

References

Additional Declarations

Status:

Version 1