Inheritance diagram for lsst.meas.base.diaCalculation.DiaObjectCalculationTask:

Public Member Functions
	__init__ (self, plugMetadata=None, **kwargs)

	initializePlugins (self)

	run (self, diaObjectCat, diaSourceCat, updatedDiaObjectIds, filterNames)

	callCompute (self, diaObjectCat, diaSourceCat, updatedDiaObjectIds, filterNames)

Public Attributes
	plugMetadata

	plugins

	outputCols

	executionDict

	log

Static Public Attributes
	ConfigClass = DiaObjectCalculationConfig

Protected Member Functions
	_validatePluginCols (self, plug)

	_initialize_dia_object (self, objId)

Static Protected Attributes
str	_DefaultName = "diaObjectCalculation"

Detailed Description

Run plugins which operate on a catalog of DIA sources.

This task facilitates running plugins which will operate on a source
catalog. These plugins may do things such as classifying an object based
on source record entries inserted during a measurement task.

This task differs from CatalogCaculationTask in the following ways:

-No multi mode is available for plugins. All plugins are assumed to run
 in single mode.

-Input and output catalog types are assumed to be `pandas.DataFrames` with
 columns following those used in the Apdb.

-No schema argument is passed to the plugins. Each plugin specifies
 output columns and required inputs.

Parameters
----------
plugMetaData : `lsst.daf.base.PropertyList` or `None`
    Will be modified in-place to contain metadata about the plugins being
    run. If `None`, an empty `~lsst.daf.base.PropertyList` will be
    created.
**kwargs
    Additional arguments passed to the superclass constructor.

Notes
-----
Plugins may either take an entire catalog to work on at a time, or work on
individual records.

Definition at line 172 of file diaCalculation.py.

Constructor & Destructor Documentation

◆ init()

lsst.meas.base.diaCalculation.DiaObjectCalculationTask.__init__	(		self,
			plugMetadata = None,
		**	kwargs )

Reimplemented from lsst.meas.base.catalogCalculation.CatalogCalculationTask.

Definition at line 207 of file diaCalculation.py.

    def __init__(self, plugMetadata=None, **kwargs):
        lsst.pipe.base.Task.__init__(self, **kwargs)
        if plugMetadata is None:
            plugMetadata = lsst.daf.base.PropertyList()
        self.plugMetadata = plugMetadata
        self.plugins = PluginMap()
        self.outputCols = []
 
        self.initializePlugins()
 

Member Function Documentation

◆ _initialize_dia_object()

lsst.meas.base.diaCalculation.DiaObjectCalculationTask._initialize_dia_object	(		self,
			objId )

protected

Create a new DiaObject with values required to be initialized by the
Apdb.

Parameters
----------
objid : `int`
    ``diaObjectId`` value for the of the new DiaObject.

Returns
-------
diaObject : `dict`
    Newly created DiaObject with keys:

    ``diaObjectId``
        Unique DiaObjectId (`int`).
    ``pmParallaxNdata``
        Number of data points used for parallax calculation (`int`).
    ``nearbyObj1``
        Id of the a nearbyObject in the Object table (`int`).
    ``nearbyObj2``
        Id of the a nearbyObject in the Object table (`int`).
    ``nearbyObj3``
        Id of the a nearbyObject in the Object table (`int`).
    ``?_psfFluxNdata``
        Number of data points used to calculate point source flux
        summary statistics in each bandpass (`int`).

Definition at line 476 of file diaCalculation.py.

    def _initialize_dia_object(self, objId):
        """Create a new DiaObject with values required to be initialized by the
        Apdb.
 
        Parameters
        ----------
        objid : `int`
            ``diaObjectId`` value for the of the new DiaObject.
 
        Returns
        -------
        diaObject : `dict`
            Newly created DiaObject with keys:
 
            ``diaObjectId``
                Unique DiaObjectId (`int`).
            ``pmParallaxNdata``
                Number of data points used for parallax calculation (`int`).
            ``nearbyObj1``
                Id of the a nearbyObject in the Object table (`int`).
            ``nearbyObj2``
                Id of the a nearbyObject in the Object table (`int`).
            ``nearbyObj3``
                Id of the a nearbyObject in the Object table (`int`).
            ``?_psfFluxNdata``
                Number of data points used to calculate point source flux
                summary statistics in each bandpass (`int`).
        """
        new_dia_object = {"diaObjectId": objId,
                          "pmParallaxNdata": 0,
                          "nearbyObj1": 0,
                          "nearbyObj2": 0,
                          "nearbyObj3": 0}
        for f in ["u", "g", "r", "i", "z", "y"]:
            new_dia_object["%s_psfFluxNdata" % f] = 0
        return new_dia_object

◆ _validatePluginCols()

lsst.meas.base.diaCalculation.DiaObjectCalculationTask._validatePluginCols	(		self,
			plug )

protected

Assert that output columns are not duplicated and input columns
exist for dependent plugins.

Parameters
----------
plug : `lsst.ap.association.DiaCalculationPlugin`
    Plugin to test for output collisions and input needs.

Definition at line 247 of file diaCalculation.py.

    def _validatePluginCols(self, plug):
        """Assert that output columns are not duplicated and input columns
        exist for dependent plugins.
 
        Parameters
        ----------
        plug : `lsst.ap.association.DiaCalculationPlugin`
            Plugin to test for output collisions and input needs.
        """
        for inputName in plug.inputCols:
            if inputName not in self.outputCols:
                errorTuple = (plug.name, plug.getExecutionOrder(),
                              inputName)
                raise ValueError(
                    "Plugin, {} with execution order {} requires DiaObject "
                    "column {} to exist. Check the execution order of the "
                    "plugin and make sure it runs after a plugin creating "
                    "the column is run.".format(*errorTuple))
        for outputName in plug.outputCols:
            if outputName in self.outputCols:
                errorTuple = (plug.name, plug.getExecutionOrder(),
                              outputName)
                raise ValueError(
                    "Plugin, {} with execution order {} is attempting to "
                    "output a column {}, however the column is already being "
                    "produced by another plugin. Check other plugins for "
                    "collisions with this one.".format(*errorTuple))
            else:
                self.outputCols.append(outputName)
 

◆ callCompute()

lsst.meas.base.diaCalculation.DiaObjectCalculationTask.callCompute	(	self,
		diaObjectCat,
		diaSourceCat,
		updatedDiaObjectIds,
		filterNames )

Run each of the plugins on the catalog.

For catalog column names see the lsst.cat schema definitions for the
DiaObject and DiaSource tables (http://github.com/lsst/cat).

Parameters
----------
diaObjectCat : `pandas.DataFrame`
    DiaObjects to update values of and append new objects to. DataFrame
    should be indexed on "diaObjectId"
diaSourceCat : `pandas.DataFrame`
    DiaSources associated with the DiaObjects in diaObjectCat.
    DataFrame must be indexed on
    ["diaObjectId", "band", "diaSourceId"]`
updatedDiaObjectIds : `numpy.ndarray`
    Integer ids of the DiaObjects to update and create.
filterNames : `list` of `str`
    List of string names of filters to be being processed.

Returns
-------
returnStruct : `lsst.pipe.base.Struct`
    Struct containing:

    ``diaObjectCat``
        Full set of DiaObjects including both un-updated and
        updated/new DiaObjects (`pandas.DataFrame`).
    ``updatedDiaObjects``
        Catalog of DiaObjects  that were updated or created by this
        task (`pandas.DataFrame`).

Raises
------
KeyError
    Raises if `pandas.DataFrame` indexing is not properly set.

Reimplemented from lsst.meas.base.catalogCalculation.CatalogCalculationTask.

Definition at line 346 of file diaCalculation.py.

                    filterNames):
        """Run each of the plugins on the catalog.
 
        For catalog column names see the lsst.cat schema definitions for the
        DiaObject and DiaSource tables (http://github.com/lsst/cat).
 
        Parameters
        ----------
        diaObjectCat : `pandas.DataFrame`
            DiaObjects to update values of and append new objects to. DataFrame
            should be indexed on "diaObjectId"
        diaSourceCat : `pandas.DataFrame`
            DiaSources associated with the DiaObjects in diaObjectCat.
            DataFrame must be indexed on
            ["diaObjectId", "band", "diaSourceId"]`
        updatedDiaObjectIds : `numpy.ndarray`
            Integer ids of the DiaObjects to update and create.
        filterNames : `list` of `str`
            List of string names of filters to be being processed.
 
        Returns
        -------
        returnStruct : `lsst.pipe.base.Struct`
            Struct containing:
 
            ``diaObjectCat``
                Full set of DiaObjects including both un-updated and
                updated/new DiaObjects (`pandas.DataFrame`).
            ``updatedDiaObjects``
                Catalog of DiaObjects  that were updated or created by this
                task (`pandas.DataFrame`).
 
        Raises
        ------
        KeyError
            Raises if `pandas.DataFrame` indexing is not properly set.
        """
        # DiaObjects will be updated in place.
        diaObjectsToUpdate = diaObjectCat.loc[updatedDiaObjectIds, :]
        self.log.info("Calculating summary stats for %i DiaObjects",
                      len(diaObjectsToUpdate))
 
        updatingDiaSources = diaSourceCat.loc[updatedDiaObjectIds, :]
        diaSourcesGB = updatingDiaSources.groupby(level=0)
        for runlevel in sorted(self.executionDict):
            for plug in self.executionDict[runlevel].single:
                if plug.needsFilter:
                    continue
                for updatedDiaObjectId in updatedDiaObjectIds:
 
                    # Sub-select diaSources associated with this diaObject.
                    objDiaSources = updatingDiaSources.loc[updatedDiaObjectId]
 
                    # Sub-select on diaSources observed in the current filter.
                    with CCContext(plug, updatedDiaObjectId, self.log):
                        # We feed the catalog we need to update and the id
                        # so as to get a few into the catalog and not a copy.
                        # This updates the values in the catalog.
                        plug.calculate(diaObjects=diaObjectsToUpdate,
                                       diaObjectId=updatedDiaObjectId,
                                       diaSources=objDiaSources,
                                       filterDiaSources=None,
                                       band=None)
            for plug in self.executionDict[runlevel].multi:
                if plug.needsFilter:
                    continue
                with CCContext(plug, diaObjectsToUpdate, self.log):
                    plug.calculate(diaObjects=diaObjectsToUpdate,
                                   diaSources=diaSourcesGB,
                                   filterDiaSources=None,
                                   band=None)
 
        for band in filterNames:
            try:
                updatingFilterDiaSources = updatingDiaSources.loc[
                    (slice(None), band), :
                ]
            except KeyError:
                self.log.warning("No DiaSource data with fitler=%s. "
                                 "Continuing...", band)
                continue
            # Level=0 here groups by diaObjectId.
            filterDiaSourcesGB = updatingFilterDiaSources.groupby(level=0)
 
            for runlevel in sorted(self.executionDict):
                for plug in self.executionDict[runlevel].single:
                    if not plug.needsFilter:
                        continue
                    for updatedDiaObjectId in updatedDiaObjectIds:
 
                        # Sub-select diaSources associated with this diaObject.
                        objDiaSources = updatingDiaSources.loc[updatedDiaObjectId]
 
                        # Sub-select on diaSources observed in the current filter.
                        try:
                            filterObjDiaSources = objDiaSources.loc[band]
                        except KeyError:
                            self.log.warning(
                                "DiaObjectId={updatedDiaObjectId} has no "
                                "DiaSources for filter=%s. "
                                "Continuing...", band)
                        with CCContext(plug, updatedDiaObjectId, self.log):
                            # We feed the catalog we need to update and the id
                            # so as to get a few into the catalog and not a copy.
                            # This updates the values in the catalog.
                            plug.calculate(diaObjects=diaObjectsToUpdate,
                                           diaObjectId=updatedDiaObjectId,
                                           diaSources=objDiaSources,
                                           filterDiaSources=filterObjDiaSources,
                                           band=band)
                for plug in self.executionDict[runlevel].multi:
                    if not plug.needsFilter:
                        continue
                    with CCContext(plug, diaObjectsToUpdate, self.log):
                        plug.calculate(diaObjects=diaObjectsToUpdate,
                                       diaSources=diaSourcesGB,
                                       filterDiaSources=filterDiaSourcesGB,
                                       band=band)
        # Need to store the newly updated diaObjects directly as the editing
        # a view into diaObjectsToUpdate does not update the values of
        # diaObjectCat.
        diaObjectCat.loc[updatedDiaObjectIds, :] = diaObjectsToUpdate
        return lsst.pipe.base.Struct(
            diaObjectCat=diaObjectCat,
            updatedDiaObjects=diaObjectsToUpdate)
 

◆ initializePlugins()

lsst.meas.base.diaCalculation.DiaObjectCalculationTask.initializePlugins ( self )

Initialize the plugins according to the configuration.

Reimplemented from lsst.meas.base.catalogCalculation.CatalogCalculationTask.

Definition at line 217 of file diaCalculation.py.

    def initializePlugins(self):
        """Initialize the plugins according to the configuration.
        """
 
        pluginType = namedtuple('pluginType', 'single multi')
        self.executionDict = {}
        # Read the properties for each plugin. Allocate a dictionary entry for
        # each run level. Verify that the plugins are above the minimum run
        # level for an catalogCalculation plugin. For each run level, the
        # plugins are sorted into either single record, or multi record groups
        # to later be run appropriately
        for executionOrder, name, config, PluginClass in sorted(self.config.plugins.apply()):
            if executionOrder not in self.executionDict:
                self.executionDict[executionOrder] = pluginType(single=[], multi=[])
            if PluginClass.getExecutionOrder() >= BasePlugin.DEFAULT_CATALOGCALCULATION:
                plug = PluginClass(config, name, metadata=self.plugMetadata)
 
                self._validatePluginCols(plug)
 
                self.plugins[name] = plug
                if plug.plugType == 'single':
                    self.executionDict[executionOrder].single.append(plug)
                elif plug.plugType == 'multi':
                    self.executionDict[executionOrder].multi.append(plug)
            else:
                errorTuple = (PluginClass, PluginClass.getExecutionOrder(),
                              BasePlugin.DEFAULT_CATALOGCALCULATION)
                raise ValueError("{} has an execution order less than the minimum for an catalogCalculation "
                                 "plugin. Value {} : Minimum {}".format(*errorTuple))
 

◆ run()

lsst.meas.base.diaCalculation.DiaObjectCalculationTask.run	(	self,
		diaObjectCat,
		diaSourceCat,
		updatedDiaObjectIds,
		filterNames )

The entry point for the DIA catalog calculation task.

Run method both updates the values in the diaObjectCat and appends
newly created DiaObjects to the catalog. For catalog column names
see the lsst.cat schema definitions for the DiaObject and DiaSource
tables (http://github.com/lsst/cat).

Parameters
----------
diaObjectCat : `pandas.DataFrame`
    DiaObjects to update values of and append new objects to. DataFrame
    should be indexed on "diaObjectId"
diaSourceCat : `pandas.DataFrame`
    DiaSources associated with the DiaObjects in diaObjectCat.
    DataFrame should be indexed on
    `["diaObjectId", "band", "diaSourceId"]`
updatedDiaObjectIds : `numpy.ndarray`
    Integer ids of the DiaObjects to update and create.
filterNames : `list` of `str`
    List of string names of filters to be being processed.

Returns
-------
returnStruct : `lsst.pipe.base.Struct`
    Struct containing:

    ``diaObjectCat``
        Full set of DiaObjects including both un-updated and
        updated/new DiaObjects (`pandas.DataFrame`).
    ``updatedDiaObjects``
        Catalog of DiaObjects  that were updated or created by this
        task (`pandas.DataFrame`).

Reimplemented from lsst.meas.base.catalogCalculation.CatalogCalculationTask.

Definition at line 278 of file diaCalculation.py.

            filterNames):
        """The entry point for the DIA catalog calculation task.
 
        Run method both updates the values in the diaObjectCat and appends
        newly created DiaObjects to the catalog. For catalog column names
        see the lsst.cat schema definitions for the DiaObject and DiaSource
        tables (http://github.com/lsst/cat).
 
        Parameters
        ----------
        diaObjectCat : `pandas.DataFrame`
            DiaObjects to update values of and append new objects to. DataFrame
            should be indexed on "diaObjectId"
        diaSourceCat : `pandas.DataFrame`
            DiaSources associated with the DiaObjects in diaObjectCat.
            DataFrame should be indexed on
            `["diaObjectId", "band", "diaSourceId"]`
        updatedDiaObjectIds : `numpy.ndarray`
            Integer ids of the DiaObjects to update and create.
        filterNames : `list` of `str`
            List of string names of filters to be being processed.
 
        Returns
        -------
        returnStruct : `lsst.pipe.base.Struct`
            Struct containing:
 
            ``diaObjectCat``
                Full set of DiaObjects including both un-updated and
                updated/new DiaObjects (`pandas.DataFrame`).
            ``updatedDiaObjects``
                Catalog of DiaObjects  that were updated or created by this
                task (`pandas.DataFrame`).
        """
        if diaObjectCat.index.name is None:
            diaObjectCat.set_index("diaObjectId", inplace=True, drop=False)
        elif diaObjectCat.index.name != "diaObjectId":
            self.log.warning(
                "Input diaObjectCat is indexed on column(s) incompatible with "
                "this task. Should be indexed on 'diaObjectId'. Trying to set "
                "index regardless")
            diaObjectCat.set_index("diaObjectId", inplace=True, drop=False)
 
        # ``names`` by default is FrozenList([None]) hence we access the first
        # element and test for None.
        if diaSourceCat.index.names[0] is None:
            diaSourceCat.set_index(
                ["diaObjectId", "band", "diaSourceId"],
                inplace=True,
                drop=False)
        elif (diaSourceCat.index.names
              != ["diaObjectId", "band", "diaSourceId"]):
            diaSourceCat.reset_index(inplace=True)
            diaSourceCat.set_index(
                ["diaObjectId", "band", "diaSourceId"],
                inplace=True,
                drop=False)
 
        return self.callCompute(diaObjectCat,
                                diaSourceCat,
                                updatedDiaObjectIds,
                                filterNames)
 

Member Data Documentation

◆ _DefaultName

str lsst.meas.base.diaCalculation.DiaObjectCalculationTask._DefaultName = "diaObjectCalculation"

staticprotected

Definition at line 205 of file diaCalculation.py.

◆ ConfigClass

lsst.meas.base.diaCalculation.DiaObjectCalculationTask.ConfigClass = DiaObjectCalculationConfig

static

Definition at line 204 of file diaCalculation.py.

◆ executionDict

lsst.meas.base.diaCalculation.DiaObjectCalculationTask.executionDict

Definition at line 222 of file diaCalculation.py.

◆ log

lsst.meas.base.diaCalculation.DiaObjectCalculationTask.log

Definition at line 404 of file diaCalculation.py.

◆ outputCols

lsst.meas.base.diaCalculation.DiaObjectCalculationTask.outputCols

Definition at line 213 of file diaCalculation.py.

◆ plugins

lsst.meas.base.diaCalculation.DiaObjectCalculationTask.plugins

Definition at line 212 of file diaCalculation.py.

◆ plugMetadata

lsst.meas.base.diaCalculation.DiaObjectCalculationTask.plugMetadata

Definition at line 211 of file diaCalculation.py.

The documentation for this class was generated from the following file:

/j/snowflake/release/lsstsw/stack/lsst-scipipe-8.0.0/Linux64/meas_base/gf18bd8381d+8d59551888/python/lsst/meas/base/diaCalculation.py

Public Member Functions

Public Attributes

Static Public Attributes

Protected Member Functions

Static Protected Attributes

Detailed Description

Constructor & Destructor Documentation

◆ __init__()

Member Function Documentation

◆ _initialize_dia_object()

◆ _validatePluginCols()

◆ callCompute()

◆ initializePlugins()

◆ run()

Member Data Documentation

◆ _DefaultName

◆ ConfigClass

◆ executionDict

◆ log

◆ outputCols

◆ plugins

◆ plugMetadata

◆ init()