0% found this document useful (0 votes)

26 views4 pages

Emirates Line Web Scraper Function

The document outlines a web scraping function for the Emirates Line schedule using Selenium, which includes error handling and logging. It provides routes for starting the scraping process, checking its status, and retrieving results. The function manages global state variables to track the scraping status, results, and any errors encountered during execution.

Uploaded by

abdesabeerahamed17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

Topics covered

Scraping Completion,
User Interface Interaction,
User Input Simulation,
Scraping Locking Mechanism,
Input Suggestions,
API Endpoints,
Result Processing,
JavaScript Execution,
XPath Selection,
Scraping Techniques

0% found this document useful (0 votes)

26 views4 pages

Emirates Line Web Scraper Function

Uploaded by

abdesabeerahamed17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

Topics covered

Scraping Completion,
User Interface Interaction,
User Input Simulation,
Scraping Locking Mechanism,
Input Suggestions,
API Endpoints,
Result Processing,
JavaScript Execution,
XPath Selection,
Scraping Techniques

# --- Selenium Imports --- # << ADDED

from selenium import webdriver

from [Link] import By
from [Link] import WebDriverWait
from [Link] import expected_conditions as EC
from [Link] import TimeoutException, NoSuchElementException,
WebDriverException
from webdriver_manager.chrome import ChromeDriverManager # Optional: Auto-manages
chromedriver

# --- Web Scraper Function --- # << ADDED

def run_emirates_scrape():
"""
Performs the web scraping task for Emirates Line schedule.
Updates global state variables upon completion or error.
"""
global scraper_status, scraper_result, scraper_error, scraper_lock

[Link]("Starting Emirates Line scraping task...")

driver = None
try:
options = [Link]()
# options.add_argument("--disable-gpu")
# options.add_argument("--headless") # Enable for production/server
environments
# options.add_argument("--no-sandbox") # Often needed in
containerized/headless environments
# options.add_argument("--disable-dev-shm-usage") # Overcomes limited
resource problems

# Use WebDriverManager or specify path directly

try:
# Attempt to use WebDriverManager first
service =
[Link](ChromeDriverManager().install())
driver = [Link](service=service, options=options)
[Link]("ChromeDriver started using WebDriverManager.")
except Exception as wdm_error:
[Link](f"WebDriverManager failed ({wdm_error}). Falling back
to default ChromeDriver path.")
# Fallback if WebDriverManager fails or isn't used
driver = [Link](options=options)
[Link]("ChromeDriver started using default path.")

[Link]("[Link]
wait = WebDriverWait(driver, 20) # Increased wait time

[Link]("Page loaded. Waiting for elements...")

# driver.save_screenshot("debug_page_loaded.png") # Optional debug
screenshot

# --- Origin Port ---

origin_port = [Link](EC.visibility_of_element_located(([Link],
"originPort")))
[Link]("Origin port input found.")
text = "Je"
for ch in text:
origin_port.send_keys(ch)
[Link](1) # Shorter delay might work, adjust if needed

[Link]("Typed 'Je'. Waiting for origin suggestions...")

# Wait for the dropdown suggestion and click
origin_suggestion = [Link](EC.element_to_be_clickable(
([Link], "//li[contains(@class,
'ui-menu-item')]/div[contains(text(),'JEBEL ALI')]"))) # More specific XPath
[Link](f"Found origin suggestion: {origin_suggestion.text}")
origin_suggestion.click()
[Link]("Clicked origin suggestion.")
[Link](0.5) # Small pause after click

# --- Destination Port ---

destination_port = [Link](EC.visibility_of_element_located(([Link],
"destinationPort")))
[Link]("Destination port input found.")
text1 = "Mu"
for ch in text1:
destination_port.send_keys(ch)
[Link](1) # Shorter delay

[Link]("Typed 'Mu'. Waiting for destination suggestions...")

# Wait for the dropdown suggestion and click
dest_suggestion = [Link](EC.element_to_be_clickable(
([Link], "//li[contains(@class,
'ui-menu-item')]/div[contains(text(),'MUNDRA, INDIA')]"))) # More specific XPath
[Link](f"Found destination suggestion: {dest_suggestion.text}")
dest_suggestion.click()
[Link]("Clicked destination suggestion.")
[Link](0.5) # Small pause

# --- Click Search ---

search_button = [Link](EC.element_to_be_clickable(
([Link], "//button[contains(@class, 'primary-btn') and
contains(text(), 'Search')]")))
[Link]("Search button found.")
search_button.click()
[Link]("Clicked search button.")

# --- Wait for and Extract Results ---

[Link]("Waiting for schedule results table...")
schedule_div = [Link](EC.presence_of_element_located(
(By.CLASS_NAME, "schedule-viewer-table-main")))
[Link]("Results table located.")

# Get only the visible text using JavaScript for cleaner output
visible_text = driver.execute_script(
"return arguments[0].innerText || arguments[0].textContent;",
schedule_div
)
[Link]("Extracted visible text from results table.")
# driver.save_screenshot("debug_results_found.png") # Optional debug

# --- Update Global State (Success) ---

with scraper_lock:
scraper_result = visible_text.strip() if visible_text else "No schedule
data found."
scraper_status = "completed"
scraper_error = None
[Link]("Scraping completed successfully.")

except TimeoutException as te:

[Link](f"Scraping timed out waiting for element: {te}",
exc_info=True)
# driver.save_screenshot("debug_timeout_error.png") # Optional debug
with scraper_lock:
scraper_error = f"Timeout waiting for element: {str(te).splitlines()
[0]}"
scraper_status = "error"
scraper_result = None
except NoSuchElementException as nse:
[Link](f"Scraping failed: Element not found: {nse}", exc_info=True)
# driver.save_screenshot("debug_notfound_error.png") # Optional debug
with scraper_lock:
scraper_error = f"Element not found: {str(nse).splitlines()[0]}"
scraper_status = "error"
scraper_result = None
except WebDriverException as wde:
[Link](f"WebDriver error during scraping: {wde}", exc_info=True)
# driver.save_screenshot("debug_webdriver_error.png") # Optional debug
with scraper_lock:
scraper_error = f"Browser/Driver error: {str(wde).splitlines()[0]}"
scraper_status = "error"
scraper_result = None
except Exception as e:
[Link](f"Unexpected error during scraping: {e}", exc_info=True)
# driver.save_screenshot("debug_unexpected_error.png") # Optional debug
with scraper_lock:
scraper_error = f"An unexpected error occurred: {str(e)}"
scraper_status = "error"
scraper_result = None
finally:
if driver:
try:
[Link]()
[Link]("WebDriver closed.")
except Exception as quit_e:
[Link](f"Error closing WebDriver: {quit_e}")
# Ensure status reflects completion or error even if finally block runs
before update
with scraper_lock:
if scraper_status == "running": # If it failed before setting status
if scraper_error is None: # Check if error was already set
scraper_error = "Scraping process ended unexpectedly."
scraper_status = "error"
scraper_result = None
[Link]("Scraping status set to 'error' in finally block.")

# --- Scraper Routes --- # << ADDED

@[Link]('/scrape/start', methods=['POST'])
def start_scrape():
"""Starts the Emirates Line scraping process in a background thread."""
global scraper_status, scraper_result, scraper_error, scraper_thread,
scraper_lock

with scraper_lock:
if scraper_status == "running":
[Link]("Scrape start requested, but already running.")
return jsonify({'success': False, 'message': 'Scraping process is
already running.'}), 409 # Conflict

# Reset state and start

scraper_status = "running"
scraper_result = None
scraper_error = None
[Link]("Starting new scraper thread.")
# Important: Pass the function to run, not the result of calling it
scraper_thread = [Link](target=run_emirates_scrape, daemon=True)
scraper_thread.start()

return jsonify({'success': True, 'message': 'Scraping process started.'})

@[Link]('/scrape/status', methods=['GET'])
def get_scrape_status():
"""Returns the current status of the scraping process."""
global scraper_status, scraper_error, scraper_lock
with scraper_lock:
response = {
'status': scraper_status,
'error': scraper_error
}
# [Link](f"Sending scrape status: {response}") # Can be verbose
return jsonify(response)

@[Link]('/scrape/results', methods=['GET'])
def get_scrape_results():
"""Returns the results of the last completed scrape."""
global scraper_status, scraper_result, scraper_error, scraper_lock
with scraper_lock:
if scraper_status == "completed":
[Link]("Sending completed scrape results.")
return jsonify({'status': 'completed', 'results': scraper_result})
elif scraper_status == "error":
[Link]("Sending scrape error details.")
return jsonify({'status': 'error', 'error': scraper_error})
elif scraper_status == "running":
[Link]("Scrape results requested, but still running.")
return jsonify({'status': 'running', 'message': 'Scraping is still in
progress.'})
else: # idle
[Link]("Scrape results requested, but no scrape has been run
yet.")
return jsonify({'status': 'idle', 'message': 'Scraping has not been
started yet.'})

Hybrid Web Scraping Techniques Overview
No ratings yet
Hybrid Web Scraping Techniques Overview
8 pages
Web Scraping Cheat Sheet Guide
No ratings yet
Web Scraping Cheat Sheet Guide
10 pages
Web Scraping Quick Start Guide
No ratings yet
Web Scraping Quick Start Guide
7 pages
AI Web Scraping Techniques Explained
No ratings yet
AI Web Scraping Techniques Explained
9 pages
MCX India Spider Data Extraction Script
No ratings yet
MCX India Spider Data Extraction Script
24 pages
Web Scraping with Python and Selenium
No ratings yet
Web Scraping with Python and Selenium
14 pages
Essential Selenium Python Operations
100% (1)
Essential Selenium Python Operations
5 pages
Building a Web Scraper for Tenders
No ratings yet
Building a Web Scraper for Tenders
12 pages
Web Crawling Tutorial in Python
No ratings yet
Web Crawling Tutorial in Python
3 pages
Python Web Scraping with BeautifulSoup
No ratings yet
Python Web Scraping with BeautifulSoup
3 pages
Saudi Real Estate Scraper Tool
No ratings yet
Saudi Real Estate Scraper Tool
23 pages
LinkedIn Scraper with Python Code
No ratings yet
LinkedIn Scraper with Python Code
4 pages
Automate Web Scraping with Python Cronjobs
No ratings yet
Automate Web Scraping with Python Cronjobs
9 pages
Browser Automation with Tkinter GUI
No ratings yet
Browser Automation with Tkinter GUI
5 pages
RPA Web Scraping with Python Guide
No ratings yet
RPA Web Scraping with Python Guide
10 pages
Automating EDC Data Download with Selenium
No ratings yet
Automating EDC Data Download with Selenium
4 pages
Advanced Techniques for Bypassing Anti-Scraping
No ratings yet
Advanced Techniques for Bypassing Anti-Scraping
4 pages
Web Scraping Cheat Sheet: Python Tools
No ratings yet
Web Scraping Cheat Sheet: Python Tools
3 pages
Selenium Python Cheat Sheet Guide
No ratings yet
Selenium Python Cheat Sheet Guide
6 pages
Practical Web Scraping For Economists 1744341390
No ratings yet
Practical Web Scraping For Economists 1744341390
33 pages
MCX India Membership Data Scraper
No ratings yet
MCX India Membership Data Scraper
17 pages
Selenium Python Cheat Sheet Guide
No ratings yet
Selenium Python Cheat Sheet Guide
1 page
Web Scraping with Python Requests
No ratings yet
Web Scraping with Python Requests
19 pages
Python Web Browser Microproject Report
No ratings yet
Python Web Browser Microproject Report
29 pages
MCX India Spider Data Extraction Script
No ratings yet
MCX India Spider Data Extraction Script
13 pages
MCX India Membership Data Scraper
No ratings yet
MCX India Membership Data Scraper
13 pages
MCX India Membership Data Scraper
No ratings yet
MCX India Membership Data Scraper
13 pages
Automate Appointment Alerts with Selenium
No ratings yet
Automate Appointment Alerts with Selenium
3 pages
Web Page Analysis for Data Scraping
No ratings yet
Web Page Analysis for Data Scraping
9 pages
Browser Automation Tool Overview
No ratings yet
Browser Automation Tool Overview
2 pages
Web Scraping Basics and Python Guide
No ratings yet
Web Scraping Basics and Python Guide
45 pages
Web Scraping Techniques Cheat Sheet
No ratings yet
Web Scraping Techniques Cheat Sheet
3 pages
Web Scraping Project Overview
No ratings yet
Web Scraping Project Overview
5 pages
Multi-Browser Testing Lab Manual
No ratings yet
Multi-Browser Testing Lab Manual
6 pages
Keyword Search and Web Crawling Tool
No ratings yet
Keyword Search and Web Crawling Tool
4 pages
Google Serper API Search Tools
No ratings yet
Google Serper API Search Tools
7 pages
Python Web Scraping Fundamentals
No ratings yet
Python Web Scraping Fundamentals
12 pages
BenchMaster Furniture Web Scraper Guide
No ratings yet
BenchMaster Furniture Web Scraper Guide
12 pages
Overview of Web Scraping Techniques
No ratings yet
Overview of Web Scraping Techniques
5 pages
Selenium Python Automation Scripts
No ratings yet
Selenium Python Automation Scripts
5 pages
Selenium Installation and Setup Guide
No ratings yet
Selenium Installation and Setup Guide
24 pages
Automating Web Scraping with Scrapy
No ratings yet
Automating Web Scraping with Scrapy
5 pages
Web Crawling with Scrapy Basics
No ratings yet
Web Crawling with Scrapy Basics
77 pages
Selenium Web Scraping for Schools Data
No ratings yet
Selenium Web Scraping for Schools Data
16 pages
Building an Image Scraper Guide
No ratings yet
Building an Image Scraper Guide
22 pages
Job Scraping with Python and Selenium
No ratings yet
Job Scraping with Python and Selenium
1 page
Web Scraping: Techniques and Ethics Guide
No ratings yet
Web Scraping: Techniques and Ethics Guide
3 pages
Python Selenium Cheat Sheet Guide
No ratings yet
Python Selenium Cheat Sheet Guide
6 pages
Hata Yönetimi ve Thread Kontrolü
No ratings yet
Hata Yönetimi ve Thread Kontrolü
24 pages
Selenium WebDriver with Python Guide
No ratings yet
Selenium WebDriver with Python Guide
9 pages
Advanced Web Scraping Techniques
No ratings yet
Advanced Web Scraping Techniques
12 pages
TikTok Booster Automation Script
No ratings yet
TikTok Booster Automation Script
6 pages
Web Crawler Design and Implementation
No ratings yet
Web Crawler Design and Implementation
7 pages
Python Project: File Conversion Tool
No ratings yet
Python Project: File Conversion Tool
11 pages
Web Scraping Automation Overview
No ratings yet
Web Scraping Automation Overview
6 pages
Quotex Sniffer Bot Implementation
No ratings yet
Quotex Sniffer Bot Implementation
1 page
Assessed Copy of Bill of Entry 9255642
No ratings yet
Assessed Copy of Bill of Entry 9255642
6 pages
FSL Freight Forwarding Procedures
No ratings yet
FSL Freight Forwarding Procedures
19 pages
Assessed Copy: Customs Bill of Entry
No ratings yet
Assessed Copy: Customs Bill of Entry
7 pages
Aadhaar Status and Personal Details List
No ratings yet
Aadhaar Status and Personal Details List
2 pages
Professional Profile of A. Musharaf
No ratings yet
Professional Profile of A. Musharaf
2 pages
Global Feeder Shipping Overview
No ratings yet
Global Feeder Shipping Overview
1 page
Vessel Profile List
No ratings yet
Vessel Profile List
1 page
Electrical Technician Resume - A. Musharaf
No ratings yet
Electrical Technician Resume - A. Musharaf
3 pages
Electrical Technician Resume - Chennai
No ratings yet
Electrical Technician Resume - Chennai
1 page
Plan Voyage Management Interface
No ratings yet
Plan Voyage Management Interface
4 pages
Expired Crew License Records
No ratings yet
Expired Crew License Records
1 page

Emirates Line Web Scraper Function

Uploaded by

Emirates Line Web Scraper Function

Uploaded by

# --- Selenium Imports --- # << ADDED

from selenium import webdriver

# --- Web Scraper Function --- # << ADDED

[Link]("Starting Emirates Line scraping task...")

# Use WebDriverManager or specify path directly

[Link]("Page loaded. Waiting for elements...")

# --- Origin Port ---

[Link]("Typed 'Je'. Waiting for origin suggestions...")

# --- Destination Port ---

[Link]("Typed 'Mu'. Waiting for destination suggestions...")

# --- Click Search ---

# --- Wait for and Extract Results ---

# --- Update Global State (Success) ---

except TimeoutException as te:

# --- Scraper Routes --- # << ADDED

# Reset state and start

return jsonify({'success': True, 'message': 'Scraping process started.'})

You might also like