Different size of matrix using create_float_source for raw and adjusted data #218

kamwal · 2022-05-05T09:58:38Z

I have found a differences in size of the matrix generated by the ds.argo.create_float_source code for generating the Wong matrix .mat for OWC analysis. The differences appear for .mat matrix for raw data and adjusted data.

    ds.argo.create_float_source(force='raw')
    ds.argo.create_float_source(force='adjusted')

The equivalent matlab code to generate float source is giving the same size of output for raw and adjusted data https://github.com/euroargodev/dm_floats/blob/master/src/ow_source/create_float_source.m

The WMO float examples where the issue have been detected: 3901928, 3900797,3900799
The mismatch between size of matrices including raw and adjusted data lead to problems in extracting differences and comparing data during checks of the quality of adjusted data.

I am using the argopy v0.1.11 version.

The text was updated successfully, but these errors were encountered:

gmaze · 2022-05-16T06:32:48Z

Hi @kamwal
Could you please share here the files generated with the float source Matlab code ?

gmaze · 2022-05-16T06:57:19Z

@kamwal
I looked at the output for WMO= 3901928
The difference is in the selection of data, or not, from the last 8 profiles:

And if I look to the netcdf file content with:

from argopy import DataFetcher as ArgoDataFetcher
WMO = 3901928
argo_loader = ArgoDataFetcher(src='gdac', cache=True, mode='expert', dataset='phy').float(WMO)
ds = argo_loader.load().data
dsp =  ds.argo.point2profile()
dsp.where(dsp['CYCLE_NUMBER']==164, drop=True)

I see the data mode to Delayed and the adjusted salinity full of NaNs with QC=4,
that's why the raw=adjusted option do not select these profiles
So I guess the question is more why the Matlab code select these ?

kamwal · 2022-05-16T08:14:10Z

3901928.zip

Thanks for looking at this.

I think it is done to don't create any issues with a mismatch of the size of the matrix for all parameters. Sometimes for some floats the QC =4 is applied not to all parameters (PRES, SAL, TEMP) like here, but only to one parameter like PSAL.
Having the same size of matrices for raw and adjusted is easier for further comparison of these two datasets.

gmaze · 2022-05-16T09:35:15Z

After discussion with @cabanesc , this appears to be motivated by the post analysis use of the .mat source files: the D netcdf files are created for profiles in the source files !
Hence no D files for profiles not reported in the source file (even if full of NaNs).

I don't know how OWC handle this, but we could fix argopy to make sure to report as many profiles as before all the filtering, and fields would be full of NaNs.

kamwal · 2022-06-08T09:16:04Z

Yes, thanks it would be very helpful

gmaze · 2022-06-08T10:22:29Z

@kamwal note that I have no idea when I'll be able to fix this ...

github-actions · 2022-09-07T10:10:27Z

This issue was marked as staled automatically because it has not seen any activity in 90 days

gmaze · 2022-11-02T09:15:03Z

I don't know how OWC handle this, but we could fix argopy to make sure to report as many profiles as before all the filtering, and fields would be full of NaNs.

Although after some thoughts I'm not sure anymore if this is the way to go, since this approach is messing up matrix content with different file uses (OWC analysis vs D file production). Basically I'm cold feet with reproducing a flawed workflow based on the Matlab software.

github-actions · 2023-01-31T10:06:04Z

This issue was marked as staled automatically because it has not seen any activity in 90 days

github-actions · 2024-04-13T10:05:46Z

This issue was closed automatically because it has not seen any activity in 365 days

kamwal added invalid This doesn't seem right argo-core About core variables (P, T, S) argo-deep About deep variables (anything below 2000db) labels May 5, 2022

gmaze mentioned this issue May 16, 2022

create_float_source code generate many nan values in matrix files #219

Open

gmaze added this to the Go from alpha to beta milestone Jun 8, 2022

github-actions bot added the stale No activity over the last 90 days label Sep 7, 2022

gmaze removed the stale No activity over the last 90 days label Sep 23, 2022

gmaze added the forQCexpert Argo QC expertise is required label Nov 2, 2022

github-actions bot added the stale No activity over the last 90 days label Jan 31, 2023

github-actions bot added the closed-as-stale label Apr 13, 2024

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Apr 13, 2024

gmaze removed the closed-as-stale label Apr 15, 2024

gmaze reopened this Apr 15, 2024

github-actions bot removed the stale No activity over the last 90 days label Apr 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different size of matrix using create_float_source for raw and adjusted data #218

Different size of matrix using create_float_source for raw and adjusted data #218

kamwal commented May 5, 2022

gmaze commented May 16, 2022

gmaze commented May 16, 2022

kamwal commented May 16, 2022

gmaze commented May 16, 2022

kamwal commented Jun 8, 2022

gmaze commented Jun 8, 2022

github-actions bot commented Sep 7, 2022

gmaze commented Nov 2, 2022

github-actions bot commented Jan 31, 2023

github-actions bot commented Apr 13, 2024

Different size of matrix using create_float_source for raw and adjusted data #218

Different size of matrix using create_float_source for raw and adjusted data #218

Comments

kamwal commented May 5, 2022

gmaze commented May 16, 2022

gmaze commented May 16, 2022

kamwal commented May 16, 2022

gmaze commented May 16, 2022

kamwal commented Jun 8, 2022

gmaze commented Jun 8, 2022

github-actions bot commented Sep 7, 2022

gmaze commented Nov 2, 2022

github-actions bot commented Jan 31, 2023

github-actions bot commented Apr 13, 2024