Merging Feature Classes in ArcGIS Pro

This article will walk you through the steps to merge two soil surveys into one comprehensive soil map.

We used soil data produced by the National Cooperative Soil Survey (USDA Natural Resources Conservation Service (NRCS): https://websoilsurvey.sc.egov.usda.gov/app/HomePage.htm

Software requirement(s):

ArcGIS Pro 3.3.0

Download package here:

https://tribalgis-my.sharepoint.com/:u:/p/sarques/EVJ-yy7gTqtFt1jHoVXgTEwBITjpf6cP7UJvWyAuRaswvg?e=lXqArc

Merging feature classes involves combining two or more datasets into a single feature class. Here are some great examples:

  • Merging land use data from different years to analyze changes over time.
  • Integrating different administrative boundary shapefiles (e.g., city, county, state boundaries) into a single dataset.
  • Combining river networks, lakes, and watershed boundaries into a unified hydrological dataset.
  • Merging vegetation or habitat maps from different surveys or studies.

Why Merge Feature Classes?

1. Consolidation: To combine multiple datasets into a single dataset for easier management and analysis.

2. Data Integration: To integrate data from different sources or different survey periods.

3. Simplification: To reduce the number of feature classes in a geodatabase, simplifying the data structure.

4. Spatial Analysis: To create a unified dataset for spatial analysis, ensuring that all relevant features are considered together.

5. Consistency: To standardize attributes and schema across different datasets.

How to Merge Feature Classes in ArcGIS Pro

1. Prepare Your Data

Ensure that the feature classes you want to merge have similar schema (fields and data types). If necessary, use the Add Field tool to add missing fields or the Field Calculator to standardize field values.

2. Open the project

Download and double click on the Merging_Feature_Classes.ppkx to open it. Save your project in a meaningful location.

The project includes two soil maps and two standalone tables that cover a small area located between Leech Lake and Lake Winnibigoshish in the Chippewa National Forest (MN).

3. Joining Soil Data with Soil maps

In the Soil Map Units 1 & 2 Attribute Tables, you can see a Map Unit Symbol field (MUSYM). It is a shortened string for the Map Unit Name containing alphanumeric identifiers, which usually correspond to soil types/names.

Currently, we only have the MUSYM identifiers without the associated soil names. However, the standalone tables do include the MUSYM (Field 1), and the map unit names (Field 2).

The first step is to join each table with its corresponding soil map unit. We'll use the MUSYM field from the two soil map units and join the tables using Field 1, which contains the MUSYM.

In the Contents pane, right click on SoilMapUnit1, select Joins and Relates, and click on Add Join,

The Add Join tool allows you to join a table to a feature layer or another table based on a common field. Here are the options you need to configure:

Input Table

This is the feature layer or table that you want to enhance with additional attributes from another table.

Input Field

This is the field in the input table that you will use to join with the join table.

Join Table

This is the table that contains the additional attributes you want to bring into the input table.

Join Field

This is the field in the join table that matches the input field in the input table.

Keep All Input Records

This option determines whether to keep all records in the input table, regardless of whether they have matching records in the join table.

Index Join Fields

This option creates an index on the join fields to improve performance, especially with large datasets. Check this if you are working with large datasets or if you will perform multiple joins on the same fields.

Join Operation

Specifies how to handle records that do not match (e.g., "KEEP_COMMON" to keep only matching records). Choose based on whether you want to keep all records or only matching records (see yellow callout for more information below).

Validate Join

Validates the join operation to ensure that it can be completed successfully. Enable this to check for potential issues before performing the join.

Join Operation

Join one to first

This option joins each record in the input table to the first matching record in the join table. If there are multiple matches, only the first one found is used. Use this option when you want a simple join and are only interested in the first match found for each record in the input table.

Example: If you have a soil map unit and there are multiple soil types listed in the join table for a single MUSYM, this option will only join the first matching soil type to each MUSYM in the input table.

Join one to many

This option joins each record in the input table to all matching records in the join table. If there are multiple matches, all of them are included in the result. Use this option when you need to preserve all matches between the input table and the join table. This is useful when the relationship between the tables is truly one-to-many.

Example: If your soil map unit has a MUSYM that corresponds to multiple soil types in the join table, this option will join all those soil types to each MUSYM in the input table.

Which Option to Choose for Soil Data

The choice depends on the nature of your datasets and the specific requirements of your analysis:

  • Join one to first: If each MUSYM in your soil map unit 1 is expected to correspond to a single soil type in the join table (or if you only care about the first match), use this option. It simplifies the data by ensuring each MUSYM has a single corresponding soil type.
  • Join one to many: If each MUSYM can correspond to multiple soil types and you need to capture all possible matches, use this option. This will provide a more comprehensive join, including all soil types associated with each MUSYM.
  • Click OK to run the tool. Right-click on the SoilMapUnit1 to open the Attribute Table and see the changes.

  • Repeat the same steps for the SoilMapUnit2 and the mapunit2 standalone table.

Now we can merge our feature classes!

4. Access the Merge Tool

  • Go to the Analysis tab, click on Tools to open the Geoprocessing pane.
  • In the Geoprocessing pane, search for the Merge tool.
  • In the Merge tool dialog, add the input feature classes you want to merge: SoilMapUnit 1 & SoilMapUnit 2 (You can also select the relevant feature classes from your geodatabase).
  • Specify the Output Dataset where the merged feature class will be saved. Choose a name and location for the output feature class: SoilMapUnits_Merge
  • If necessary, configure the Field Map to ensure that fields are mapped correctly between the input feature classes. I'll remove the Fields that have Null values (hover on the field to display the Remove option): Fields 3 - 4, 6 - 11 and 13 - 22.

Add Source Information to Output Option

The option to Add source information to output allows you to include additional fields in the output feature class that indicate the origin of each merged record.

These fields typically include:

  • Source_FID: The ObjectID (or unique identifier) from the original feature class.
  • Source_NAME: The name of the original feature class from which the record came.

Purpose and Benefits

  • Adding source information helps trace back which original dataset each feature in the merged output came from. This is particularly useful for quality control, debugging, and understanding the lineage of your data.
  • It aids in managing and documenting the merged data, especially if you are combining datasets from multiple sources with potentially overlapping or conflicting information.
  • It allows for differentiated analysis or visualization based on the source. For example, you can color-code or filter features based on their original source.

When to Use It?

  • Use this option if you need to document the source of each feature for transparency or reporting purposes.
  • If you are concerned about the quality or consistency of data from different sources, having source information can help you identify and address issues.
  • If you plan to analyze the merged data based on its original sources, adding source information will facilitate such analyses.

When You Might Not Need It?

  • If the datasets you are merging are very similar and you do not need to differentiate between them later, you might not need this additional information.
  • If you prefer a simpler output without extra fields, especially if the source information is not crucial for your current project.
  • Click Run to execute the merge process.

5. Verify the Output

  • After the merge process completes, verify the output feature class to ensure that all features have been correctly combined.
  • Check the attribute table to confirm that the data from the input feature classes has been properly integrated.

The merge process was successful, but we still see the line separating our two map units. To create a better merge, we are going to use the Remove Overlap tool.

Alternatively, if you want to remove the line without using the Remove Overlap tool, you can try the following methods:

Dissolve Tool

The Dissolve tool can be used to combine adjacent polygons that share a common attribute into a single polygon, effectively removing the boundary lines between them.

1. Open the Dissolve tool (Analysis > Tools > Data Management > Generalization > Dissolve).

2. Select your merged feature class as the input.

3. Choose an appropriate field (or no field) for the Dissolve Fields parameter if you want to dissolve all boundaries.

4. Run the tool to create a new feature class with dissolved boundaries.

Editing Tools

Manual editing using the editing tools allows you to merge polygons by removing the boundary line.

1. Start an editing session.

2. Use the Edit Vertices tool to manually adjust the vertices of the polygons to remove the line.

3. Use the Union tool under the Modify Features pane to combine the polygons.

4. Save your edits.

Union Tool

The Union tool can be used to combine overlapping polygons from different layers into one feature class, which can help in managing overlaps and removing boundaries.

1. Open the Union tool (Analysis > Tools > Overlay > Union).

2. Select your merged feature class as the input.

3. Run the tool to create a new feature class where overlapping areas are handled.

By using these methods, you can remove the boundary line between your map units and create a more seamless merged feature class without relying solely on the Remove Overlap tool.

6. Remove Overlap

  • Go to the Analysis tab, click on Tools to open the Geoprocessing pane.
  • In the Geoprocessing pane, search for the Remove Overlap tool.
  • Input Features: Select the merged soil map unit feature class SoilMapUnits_Merge.
  • Output Feature Class: Specify the name and location for the output feature class where the results will be saved: SoilMapUnits_RemoveOverlap.
  • Method: Choose the appropriate method (Center LIne, Thiessen, or Grid Overlap) based on your analysis needs: Thiessen Overlap here.

Methods

Center Line Method

This method creates a line halfway between the overlapping polygon boundaries and splits the overlap along this line. Best suited for cases where you want a balanced distribution of the overlap area between the polygons. This is often used in scenarios where the overlap area is considered to equally belong to both polygons, such as administrative boundaries or property lines.

Thiessen Overlap Method

This method assigns the overlapping area to the closest polygon based on the perpendicular distance to the edges of the polygons. The resulting boundary is adjusted to maintain proximity relationships. Ideal for scenarios where the proximity to the original polygons is important. This method is commonly used in hydrological modeling, resource allocation, or any situation where the closest feature should dictate the boundary.

Grid Overlap Method

This method divides the overlap area using a grid-based approach. It systematically allocates sections of the overlap to the adjacent polygons based on a grid. Useful for large datasets where a systematic and computationally efficient approach is needed. It can also be beneficial when the overlap needs to be divided in a regular pattern, such as in environmental modeling or large-scale land cover studies.

When to Use Each Method?

Center Line Method

  • You need a fair and balanced division of the overlap area.
  • The overlap area should be shared equally.
  • Precision in maintaining the middle line is crucial.

Examples: Administrative boundaries, property lines, Urban planning boundaries.

Thiessen Overlap Method

  • The proximity of polygons is important for the analysis.
  • You want the closest polygon to "claim" the overlap area.
  • Maintaining natural boundaries is essential.

Examples: Hydrological studies, natural resource allocation, ecological or habitat boundaries.

Grid Overlap Method

  • You need a method that scales well with large datasets.
  • The overlap should be divided systematically.
  • Regular pattern division is acceptable or preferred.

Examples: Large-scale environmental studies, land cover and land use analysis, agricultural or forestry planning.

  • Click Run to execute the Remove Overlap process.

  • Verify the Output.

Merge tool or Remove Overlap tool?

When you use the Remove Overlap tool, the tool not only resolves the overlaps but also merges the polygons as part of the process. You can do both with one tool!

Practical Tips

Evaluate the Nature of Overlap

Understand the context of your dataset and the importance of proximity, fairness, and systematic division before choosing the method.

Test Different Methods

If unsure, run the different methods on a subset of your data to visually inspect the results and determine which method best fits your needs.

Document Your Process

Keep a record of the method used, the rationale behind the choice, and any adjustments made during the process. This documentation will be helpful for future reference and for ensuring consistency in your workflow.

7. Enhancing Soil Map Units

Now that we have a clean set of soil map units with resolved overlaps, the next step is to modify the symbology to display the soil names. By adjusting the symbology, we can create a more informative and visually appealing map that clearly identifies each soil types. To do this we'll use the field containing the soil names, complexes or associations (Field 2 obtained through our join operation) as the basis for the symbology. This will allow us to differentiate each soil unit by its specific type, making it easier to analyze and interpret the spatial distribution of soil types across the study area.

Key Considerations when Merging Shapefiles

Geometries

Ensure that the geometries (points, lines, polygons) of the feature classes you are merging are the same. You cannot merge a point feature class with a polygon feature class.

Coordinate Systems

All feature classes should have the same coordinate system. If they don't, reproject them to a common coordinate system before merging.

Attribute Fields

Ensure that the attribute fields (columns) you want to preserve are compatible. You may need to harmonize field names and data types. For instance, if one shapefile has a field named Soil_Type and another has SoilClass, you should rename them to match.

Field Mapping:

Use the "Field Map" option in the Merge tool to manage how fields from different feature classes are combined. You can choose to keep or discard certain fields and specify how values should be handled.

Tips and Tricks

Preparing Data

Clean and prepare your datasets before merging. This includes removing duplicates, correcting geometry errors, and ensuring attribute consistency.

Backup Data: Always create a backup of your original data before performing operations like merging.

Field Mapping

Utilize the field mapping feature in the Merge tool to control how attributes from different feature classes are handled. This allows you to merge fields with different names and resolve conflicts.

Using the Geoprocessing Tool

In ArcGIS Pro, use the Merge geoprocessing tool to combine feature classes. Configure the tool with the appropriate input datasets and specify the output feature class.

Quality Check

After merging, perform a quality check to ensure that the merged dataset has no geometry errors, attribute inconsistencies, or duplicate features.

Handling Large Datasets

For large datasets, consider using the Append tool if you need to add new features to an existing feature class incrementally.

Documenting Changes

Keep track of the changes and document the process. This includes noting which datasets were merged, any transformations applied, and how fields were mapped.

Check for Duplicates: After merging, check for and address any duplicate features or records, especially if the input datasets have overlapping spatial extents or identical features.

We hope that this article has been helpful! If you have any feedback or questions, please feel free to send us an email or connect with us for a chat. The NTGISC team is here to assist you further!


Resource (s):

https://pro.arcgis.com/en/pro-app/latest/tool-reference/data-management/merge.htm

Soil Survey Staff, Natural Resources Conservation Service, United States Department of Agriculture. Web Soil Survey. Available online at the following link: http://websoilsurvey.sc.egov.usda.gov/

https://websoilsurvey.nrcs.usda.gov/app/WebSoilSurvey.aspx

https://pro.arcgis.com/en/pro-app/latest/tool-reference/analysis/remove-overlap-multiple.htm

Did this answer your question? Thanks for the feedback There was a problem submitting your feedback. Please try again later.

Still need help? Contact Us Contact Us