SmeetPatel commited on
Commit
e261135
·
verified ·
1 Parent(s): ccf29cc

explanation and future improvements

Browse files
Files changed (1) hide show
  1. app.py +14 -8
app.py CHANGED
@@ -37,11 +37,12 @@ st.subheader("Visualization 1: Distribution of Building Usage")
37
  st.altair_chart(usage_bar, use_container_width=True)
38
  st.text("""
39
  This bar chart shows the distribution of buildings based on their primary usage type. The following design choices were made:
40
- - Horizontal bars make it easier to read the usage categories.
41
- - The bars are sorted by count to highlight the most common building usages.
42
  - Each bar is uniquely colored for visual distinction.
43
  Future Improvements:
44
- - Adding filters for regions or building statuses could provide more detailed insights.
 
45
  """)
46
 
47
  # Visualization 2: Relationship Between Year Constructed and Square Footage
@@ -84,12 +85,17 @@ st.altair_chart(scatter_plot, use_container_width=True)
84
  st.text("""
85
  This scatter plot highlights the relationship between the square footage of a building and its total floors.
86
  Design choices include:
87
- - Square Footage is represented on the x-axis, as it provides a numeric measure of building size.
88
- - Total Floors is represented on the y-axis to show the height distribution of buildings.
89
- - Points are color-coded by building status to differentiate operational buildings.
90
- - Tooltips provide additional information for exploration.
 
91
  Future Improvements:
92
- - Adding filters for building usage or region could enhance the analysis.
 
 
 
 
93
  """)
94
 
95
 
 
37
  st.altair_chart(usage_bar, use_container_width=True)
38
  st.text("""
39
  This bar chart shows the distribution of buildings based on their primary usage type. The following design choices were made:
40
+ - Horizontal Layout: Horizontal bars were used for better readability, especially since Usage Description values are text-heavy. This layout prevents truncation and allows easier comparison.
41
+ - Sorting the categories by count, in descending order, ensures that the most common usage types are immediately visible, aiding prioritization of insights.
42
  - Each bar is uniquely colored for visual distinction.
43
  Future Improvements:
44
+ - Adding filters based on Region, Bldg Status, or other columns would allow users to view the distribution of building usage for specific subsets of data.
45
+ - It can also incorporate a feature to drill down into specific categories (e.g., Usage Description) and see detailed statistics for subcategories.
46
  """)
47
 
48
  # Visualization 2: Relationship Between Year Constructed and Square Footage
 
85
  st.text("""
86
  This scatter plot highlights the relationship between the square footage of a building and its total floors.
87
  Design choices include:
88
+ - Scatter Plot Representation: A scatter plot was chosen as it is ideal for exploring relationships between two continuous variables (Square Footage and Total Floors). This allows patterns or clusters to be identified, such as whether larger buildings tend to have more floors.
89
+ - Color Coding by Building Status: Points were color-coded by Bldg Status to differentiate operational and non-operational buildings. This helps identify trends within specific categories of buildings.
90
+ - Dynamic Domain: The x-axis (Square Footage) and y-axis (Total Floors) were scaled dynamically to accommodate the full range of data, ensuring no points were excluded.
91
+ - Interactive Tooltips: Tooltips were added to provide detailed information about each point, such as the building’s name, location, size, and total floors. This makes the visualization interactive and detailed.
92
+ - Point Size: A fixed point size ensures clarity while maintaining focus on relationships between variables without visual clutter.
93
  Future Improvements:
94
+
95
+ - Adding a Third Variable: Introduce point size encoding to represent a third variable, such as Floors Above Grade, for richer insights.
96
+ - Filters for Building Usage or Region: Enable filtering by Usage Description or Region to analyze specific subsets of data, like residential buildings in a particular county.
97
+ - Trend Line or Clustering: Add a regression line or clustering to help identify trends or group similar buildings based on size and floors.
98
+ - Zoom and Pan Functionality: Incorporating zoom and pan features would improve navigation for datasets with wide ranges in square footage.
99
  """)
100
 
101