alex.applications.PublicTransportInfoEN.data.preprocessing package

Submodules

alex.applications.PublicTransportInfoEN.data.preprocessing.compatibility_script_manual module

A script that basically creates a csv file that contains a list of places from INPUT_FILE sith second column of a STRING_SAME_FOR_ALL and the benefit is that it can merge with already existing OUTPUT_FILE

unless -c flag is set.

Usage: /.compatibility_script_manual –name OUTPUT_FILE –main-place STRING_SAME_FOR_ALL –list INPUT_FILE [-c]

alex.applications.PublicTransportInfoEN.data.preprocessing.compatibility_script_manual.handle_compatibility(file_in, file_out, main_place, no_cache=False)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.compatibility_script_manual.main()[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.compatibility_script_manual.read_prev_compatibility(filename)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.compatibility_script_manual.save_set(output_file, output_set, separator=u'; ')[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.compatibility_script_manual.stick_place_in_front(place, list)[source]

alex.applications.PublicTransportInfoEN.data.preprocessing.mta_to_csv module

A script that takes mta stops file and it selects important fields and saves them (works with GTFS mainly) Usage:

./mta_to_csv.py [-m: main_city] [-o: output_file] stops.txt

alex.applications.PublicTransportInfoEN.data.preprocessing.mta_to_csv.average_same_stops(same_stops)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.mta_to_csv.extract_fields(lines, header, main_city, skip_comments=True)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.mta_to_csv.get_column_index(header, caption, default)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.mta_to_csv.group_by_name(data)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.mta_to_csv.load_list(filename, skip_comments=True)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.mta_to_csv.main()[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.mta_to_csv.remove_duplicities(lines)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.mta_to_csv.remove_following_duplicities(lines)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.mta_to_csv.write_data(file_name, data)[source]

alex.applications.PublicTransportInfoEN.data.preprocessing.stops_to_streets_experiment module

A script that takes mta stops, it splits them by special characters and each item takes for a street

alex.applications.PublicTransportInfoEN.data.preprocessing.stops_to_streets_experiment.average_same_stops(same_stops)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.stops_to_streets_experiment.extract_stops(lines)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.stops_to_streets_experiment.get_column_index(header, caption, default)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.stops_to_streets_experiment.group_by_name(data)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.stops_to_streets_experiment.load_list(filename)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.stops_to_streets_experiment.main()[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.stops_to_streets_experiment.remove_duplicities(lines)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.stops_to_streets_experiment.remove_following_duplicities(lines)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.stops_to_streets_experiment.write_data(file_name, data)[source]

alex.applications.PublicTransportInfoEN.data.preprocessing.us_cities_to_csv module

A script that takes us cities (city state_code)file and state-codes and it joins them

Usage:

./us_cities_to_csv.py [-o: output_file] cities.txt state-codes.txt

alex.applications.PublicTransportInfoEN.data.preprocessing.us_cities_to_csv.average_same_city(same_stops)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.us_cities_to_csv.extract_fields(lines, header, state_dictionary, skip_comments=True)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.us_cities_to_csv.get_column_index(header, caption, default)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.us_cities_to_csv.group_by_city_and_state(data)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.us_cities_to_csv.load_list(filename, skip_comments=True)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.us_cities_to_csv.load_state_code_dict(file_state_codes, skip_comments=True)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.us_cities_to_csv.main()[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.us_cities_to_csv.remove_duplicities(lines)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.us_cities_to_csv.remove_following_duplicities(lines)[source]
alex.applications.PublicTransportInfoEN.data.preprocessing.us_cities_to_csv.write_data(file_name, data)[source]

Module contents