# How to Find the Missing Number in an Array In an array containing numbers from 1 to N, where one number is missing, the goal is to find the missing number. Here, we'll discuss four different methods to achieve this, ranging from brute force to optimal approaches. ### Solution 1: Brute Force Approach (Using Nested Loop & Linear Search) This approach involves checking for each number from 1 to N whether it exists in the array. **Implementation**: ```cpp // Solution-1: Brute Force Approach (using Nested Loop & Linear Search) // Time Complexity: O(n*n) // Space Complexity: O(1) int findMissingNumber(vector &arr, int n) { // Outer loop that runs from 1 to N for (int i = 1; i <= n; i++) { // `flag` variable to check if an element exists bool flag = false; // Search the element using Linear Search for (int j = 0; j < n - 1; j++) { if (arr[j] == i) { flag = true; break; } } // Check if the element is missing if (!flag) { return i; } } // If loop completes without finding a missing number (shouldn't happen) // It is just to avoid warnings. return -1; } ``` **Logic**: 1. **Outer Loop**: Iterate through numbers from 1 to N. 2. **Linear Search**: For each number, check if it exists in the array using a nested loop. 3. **Flag Check**: Use a flag to indicate if the number was found. If not, return the number as it is the missing one. **Time Complexity**: O(n²) * **Explanation**: The outer loop runs from 1 to N and for each iteration of the outer loop, the inner loop runs up to N-1. This results in a time complexity of O(n \* (n-1)), which simplifies to O(n²). **Space Complexity**: O(1) * **Explanation**: Only a few additional variables (like `flag`) are used, which do not depend on the size of the input array. **Example**: * **Input**: `arr = [1, 2, 4, 6, 3, 7, 8]`, `n = 8` * **Output**: `5` * **Explanation**: The number `5` is missing from the array. --- ### Solution 2: Better Approach (Using Hashing) This approach uses a hash table to track the presence of numbers in the array. **Implementation**: ```cpp // Solution-2: Better Approach (using Hashing) // Time Complexity: O(n) + O(n) ~ O(2n) // Space Complexity: O(n) int findMissingNumber(vector &arr, int n) { // Hash table to store the presence of numbers vector hash(n + 1, 0); // Mark the presence of elements in the hash table for (int i = 0; i < n - 1; i++) { hash[arr[i]] = 1; } // Find the missing number by iterating through the hash table for (int i = 1; i <= n; i++) { if (hash[i] == 0) { return i; } } // If loop completes without finding a missing number (shouldn't happen) // It is just to avoid warnings. return -1; } ``` **Logic**: 1. **Initialize Hash Table**: Create a hash table to track numbers from 1 to N. 2. **Mark Presence**: Iterate through the array, marking the presence of each number in the hash table. 3. **Find Missing Number**: Iterate through the hash table to find the number that is not marked. **Time Complexity**: O(n) * **Explanation**: The first loop runs through the array of size N-1 to populate the hash table and the second loop runs through the hash table of size N to find the missing number. Thus, the overall time complexity is O(n) + O(n-1), which simplifies to O(n). **Space Complexity**: O(n) * **Explanation**: An additional hash table of size N is used to keep track of the presence of each number. **Example**: * **Input**: `arr = [1, 2, 4, 6, 3, 7, 8]`, `n = 8` * **Output**: `5` * **Explanation**: The number `5` is not present in the hash table. --- ### Solution 3: Optimal Approach 1 (Summation Approach) This approach uses the sum formula for the first N natural numbers to find the missing number. **Implementation**: ```cpp // Solution-3: Optimal Approach 1 (Summation Approach) // Time Complexity: O(n) // Space Complexity: O(1) int findMissingNumber(vector &arr, int n) { // Sum of all numbers from 1 to N int sumOfNNums = (n * (n + 1)) / 2; int sumOfArr = 0; // Calculate the actual sum of elements in the array for (int i = 0; i < n - 1; i++) { sumOfArr += arr[i]; } int missingNum = sumOfNNums - sumOfArr; return missingNum; } ``` **Logic**: 1. **Sum of N Numbers**: Calculate the sum of numbers from 1 to N using the formula sum = (n \* (n + 1)) / 2. 2. **Sum of Array**: Calculate the sum of elements in the array. 3. **Find Missing Number**: Subtract the sum of the array from the sum of N numbers to get the missing number. **Time Complexity**: O(n) * **Explanation**: The loop runs through the array once to calculate the sum of the elements, resulting in a time complexity of O(n). **Space Complexity**: O(1) * **Explanation**: Only a few additional variables (like `sumOfNNums` and `sumOfArr`) are used, which do not depend on the size of the input array. **Example**: * **Input**: `arr = [1, 2, 4, 6, 3, 7, 8]`, `n = 8` * **Output**: `5` * **Explanation**: The sum of numbers from `1` to `8` is `36` and the sum of the array is `31.` Thus, the missing number is `36 - 31 = 5`. --- ### Solution 4: Optimal Approach 2 (XOR Approach) This approach uses the XOR operation to find the missing number efficiently. **Implementation**: ```cpp // Solution-4: Optimal Approach 2 (XOR Approach) // Time Complexity: O(n) // Space Complexity: O(1) int findMissingNumber(vector &arr, int n) { int xor1 = 0; int xor2 = 0; for (int i = 0; i < n - 1; i++) { // XOR of array elements xor2 ^= arr[i]; // XOR up to [1...N-1] xor1 ^= (i + 1); } // XOR up to [1...N] xor1 ^= n; // The missing number return xor1 ^ xor2; } ``` **Logic**: 1. **XOR Array Elements**: Compute the XOR of all elements in the array. 2. **XOR Up to N**: Compute the XOR of all numbers from 1 to N. 3. **Find Missing Number**: XOR the results from steps 1 and 2. The result will be the missing number. **Time Complexity**: O(n) * **Explanation**: The first loop runs through the array of size N-1 to calculate the XOR of the elements and the second loop runs up to N to calculate the XOR of the range from 1 to N. Thus, the overall time complexity is O(n-1) + O(n), which simplifies to O(n). **Space Complexity**: O(1) * **Explanation**: Only a few additional variables (like `xor1` and `xor2`) are used, which do not depend on the size of the input array. **Example**: * **Input**: `arr = [1, 2, 4, 6, 3, 7, 8]`, `n = 8` * **Output**: `5` * **Explanation**: XOR of all array elements and XOR of numbers from `1` to `8` will result in the missing number, which is `5`. --- ### Comparison * **Brute Force Approach**: * **Pros**: Simple and easy to understand. * **Cons**: Inefficient with O(n²) time complexity. * **Hashing Approach**: * **Pros**: More efficient with O(n) time complexity. * **Cons**: Uses extra space for the hash table. * **Summation Approach**: * **Pros**: Very efficient with O(n) time complexity and O(1) space complexity. * **Cons**: Might face integer overflow for very large values of N. * **XOR Approach**: * **Pros**: Efficient with O(n) time complexity and O(1) space complexity. * **Cons**: Slightly more complex to understand compared to summation approach. ### Edge Cases * **Empty Array**: If the input array is empty, the missing number should be `1`. * **Single Element Missing**: When the array contains only one element, either `1` (missing `2`) or `2` (missing `1`), handle by checking and returning the missing number. * **All Elements Present**: If the array contains all elements from 1 to N without any missing number (should not happen if constraints guarantee a missing number), return a sentinel value like `-1`. * **Duplicated Elements**: This scenario is not considered because the problem constraints specify that all elements are distinct. ### Additional Notes * **Integer Overflow**: For the summation approach, consider the risk of integer overflow when calculating the sum of the first N natural numbers. In practical implementations, ensure the language or environment can handle large integer values. * **Data Types**: Choose appropriate data types to handle the range of input values, especially for large N. * **Performance Considerations**: While the brute force approach is easy to implement, it is not suitable for large arrays due to its O(n²) time complexity. The hashing and XOR approaches are preferable for large datasets. * **In-place Modifications**: For space efficiency, the XOR approach modifies the input array in place. Ensure that this behavior is acceptable in the context of the problem or use case. * **Understanding XOR**: The XOR approach leverages the property that `a ^ a = 0` and `a ^ 0 = a`. It effectively cancels out duplicate elements and isolates the missing number. This method is both efficient and elegant but requires understanding of bitwise operations. ### Conclusion Finding the missing number in an array of size N-1 can be achieved using various methods, from simple brute force to optimal XOR-based solutions. Depending on the constraints and requirements, one can choose the most suitable approach. ---