Package : via_DMS¶
Module : DMSDatabase¶
Module : FileOperations¶
-
class
src.data_access.via_DMS.FileOperations.
FileOperations
(analysis_jobs=None, parent_folder=None, job_info=None)[source]¶ Bases:
object
Grab locations of the MSGF+ & MASIC analysis tools using analysis_jobs object
-
parse_fileserverpath_to_web_url
(file_server_path)[source]¶ Converts Windows FileSever path to webURL. :param file_server_path: windows server file path. :return:
-
use_df
(df)[source]¶ Called for each dataset in the dataFrame! :param df: reference to analysis_jobs object. :return:
-
get_files
(**kw)¶
-
Module : Input¶
Module : Query¶
-
class
src.data_access.via_DMS.Query.
Query
[source]¶ Bases:
object
SQL queries to access data from DMS
-
DATASET_MSFG
= " SELECT A.Dataset_ID,\n A.MSGFPlusJob,\n B.MasicJob\n FROM ( SELECT Dataset_ID,\n Job AS MSGFPlusJob\n FROM DMS_Data_Package.dbo.V_Data_Package_Analysis_Jobs_List_Report\n WHERE ID = {} AND\n Tool LIKE 'msgf%' ) A\n LEFT OUTER JOIN\n ( SELECT Dataset_ID,\n Job AS MasicJob\n FROM DMS_Data_Package.dbo.V_Data_Package_Analysis_Jobs_List_Report\n WHERE ID = {} AND\n Tool LIKE 'masic%' ) B\n ON A.Dataset_ID = B.Dataset_ID\n "¶
-
MSGF_loc
= 'SELECT JobNum As MSGFPlusJob, [Data Folder Link] As MSGFplus_loc\n FROM V_Analysis_Job_Detail_Report_2\n WHERE JobNum IN ({})'¶
-
DATASET_MASIC
= " SELECT Dataset_ID, Max(Job) As NewestMasicJob\n FROM V_Analysis_Job_List_Report_2\n WHERE Dataset_ID In ({})\n And Tool Like 'masic%'\n Group by Dataset_ID"¶
-
MASIC_loc
= 'SELECT JobNum AS NewestMasicJob , [Results Folder Path] As MASIC_loc\n FROM V_Analysis_Job_Detail_Report_2\n WHERE JobNum IN ({})'¶
-
DATASET
= " SELECT Dataset_ID, Job As MSGFPlusJob, [Results Folder Path] As MSGFplus_loc\n FROM V_Analysis_Job_List_Report_2\n WHERE Job IN ( SELECT Max(Job)\n FROM V_Analysis_Job_List_Report_2\n WHERE [Tool] LIKE 'msgf%' AND\n Dataset_ID IN ({})\n GROUP BY dataset_id )"¶
-
MSGF
= 'SELECT Dataset_ID, Job As MSGFPlusJob,[Results Folder Path] As MSGFplus_loc \n FROM V_Analysis_Job_List_Report_2\n WHERE Job IN ({})'¶
-
JOB_INFO
= 'SELECT Job, Dataset, Experiment, OrganismDBName, ProteinCollectionList, ParameterFileName\n FROM V_Analysis_Job_Export\n Where Job In ({}) \n Order By Dataset, job'¶
-
Module : QueryBuilder¶
-
class
src.data_access.via_DMS.QueryBuilder.
QueryBuilder
(user_input=None, storage=None, project_name=None)[source]¶ Bases:
object
Build MS-SQl Queries.
Execute them
- create a dataFrame that holds all Information
Dataset_ID | MSGFPlusJob | Data Folder Link | NewestMasicJob | Results Folder Path |
-
save_to_disk
(data, data_path, msgf_job_list, id)[source]¶ - Parameters
data –
data_path –
msgf_job_list –
id –
- Returns
-
start_with_datapackage_id
(id)[source]¶ Given a ID —-Find out the Dataset_ID , MSGFPlusJob ——–Using MSGFPlusJob, findout “Data Folder Link” ——–Using Dataset_ID, findout NewestMasicJob ————Using NewestMasicJob findout “Results Folder Path” Merge results to create “analysis_jobs”.
- Parameters
id – datapackage_id
- Returns
-
start_with_dataset_ids
(id_list)[source]¶ Given set of dataset-IDs —-findout MSGFPlusJob, “Results Folder Path” ——–Using Dataset_ID, findout NewestMasicJob ————Using NewestMasicJob findout “Results Folder Path” Merge results to create “analysis_jobs”.
- Parameters
id_list – set of dataset-IDs
- Returns